Skip Navigation
Posts
39
Comments
220
Joined
2 yr. ago
  • Anytime. Thank you for reaching out.

  • The admin of startrek.website reached out to me - guessing because of this thread? And seems responsive enough so I’ve removed the block and will work with them on any new issues that arise. Apologies for the trouble.

  • I’m certainly willing to give startrek another shot - the reports about them were getting overwhelming at one point, which led me to blocking the instance.

  • I think about this sometimes…

  • Can you send me the ip address for a device that is not working?

  • I’ll take a look. Apologies for the trouble

  • The problem is related to a periodic denial of service caused by lots of hits to the login page, which is an extremely expensive operation and exhausts php processes for other things like what you’re seeing.

  • I will investigate tomorrow. My apologies.

  • I’ll raise it with the mbin team. My apologies

  • As far as I can tell, it’s a use case that mbin isn’t handling. I’ll open a GitHub issue when I have some time.

  • Automod?

  • That one is written for lemmy, and there is no real compatibility between mbin and lemmy. I’m not aware of such a thing for mbin, unfortunately. (Note: I also have a lemmy instance and it doesn’t even work that well there)

  • I'm working on it. fediseer is complaining that it can't communicate with fedia.io so trying to sort that out

  • Microblogs are basically all the regular fediverse posts (from mastodon et al) that are tagged with the hashtag(s) associate with the magazine.

    If you hit the accept button on a report, it will remove the thing that was reported (post, thread, comment, etc), and if you hit reject, it just closes the report.

  • Btw, Fedia.io is running on a latest gen AMD Epyc with 96 CPUs, an nvme ssd, and 256gb of ddr5 exc memory. And Fedia is the only thing running. So it’s not a capacity issue, but rather some challenges with the code we keep bumping into because (I think) we are pushing it harder than other instances.

  • Fedia Discussions @fedia.io
    jerry @fedia.io

    Fedia federation woes

    My apologies for the recent spate of problems. I think I’ve narrowed the problem down to the /m/fediverse and /m/random magazines. For some reason, mbin is generating an enormous amount of outbound delivery messages for these two magazines. I first tried removing the hashtags from /m/fedivese, but that was only a quick fix. So I deleted the magazine. (Note, the notifications appear to be related to the “microblog” function, and were originating from accounts on lots of mastodon instances, so I think there is a bug somewhere).

    I noticed /m/random doing something similar. I have removed all the subscribers from that magazine to try to reduce the number of notifications it is sending. I don’t know if that will help - I have a feeling the instance can’t keep up with that happening in both random and fediverse.

    Anyhow, the queues are draining fast now. I purged about 600000 queued delivery messages that (based on a random sample) all appeared to be associated with fediverse and rando

  • I think there is a lag - I see this post reflected on lemmy.world: https://lemmy.world/c/FloatingIsFun@fedia.io?dataType=Post&sort=New

  • What websites are similar to brandcolors.net, but for video game UI colors?

  • Apologies - i kicked it and it's working again :(

  • I'm not seeing any issues - can you give me some examples?

  • Rabbitmq was b0rked. I fixed it a few hours ago. Let me know if you see more issues.

  • Fedia Discussions @fedia.io
    jerry @fedia.io

    Fedia.io downtime - Saturday, December 14

    The following instances will be offline briefly on Saturday, December 14 from 9am ET / 2pm UTC for approxmately 10 minutes: infosec.exchange infosec.town infosec.pub pixel.infosec.exchange books.infosec.exchange matrix/element.infosec.exchange relay.infosec.exchange meetup.infosec.exchange video.infosec.exchange infosec.press infosec.place fedia.io fedia.social elk,.infosec.exchange infosec.space convo.casa

    The servers supporting these instances require a reboot. The Dell servers these instances run on take a very long time to boot, so I am estimating 10 minutes of downtime. It could be more, could be less.

    We use live patches to minimize reboots needed for patching, however Ubuntu only provides livepatch support for a year, which is how long most of these systems have been running for.

    Fedia Discussions @fedia.io
    jerry @fedia.io

    Federation is broke again.

    It’s been a long day. I will fix it when I am back in front of a computer. It might be a few hours. My apologies.

    Fedia Discussions @fedia.io
    jerry @fedia.io

    After a much needed break for a few days, fedia.io seems to be back to having rabbitmq issues that result in federation pausing.

    I have sort of given up in fixing the problem, and will instead work on auto-detecting and auto-recovering when the problem happens.

    Fedia Discussions @fedia.io
    jerry @fedia.io

    .io TLD going away?

    I just saw this: https://every.to/p/the-disappearance-of-an-internet-domain

    I have no idea if it's real, but if it is, that will be most unfortunate

    Fedia Discussions @fedia.io
    jerry @fedia.io

    Fedia.io should be stable now

    After I resolved the federation issue, I had to clean up a few things and so the site may have been unavailable for a bit. I'm done fussing with it and will keep an eye on it to make sure things are working.

    IF YOU SEE PROBLEMS - please let me know. As far as I know, I've fixed all of the federation and error 500 issues we've had, so please don't assume it's just more of the same if you see them.

    Thanks for your patience.

    Fedia Discussions @fedia.io
    jerry @fedia.io

    Good news: the federation problem on fedia.io, as well as the problem with getting an error 500 when submitting a thread, have been solved.

    Fedia.io is sort of like she Ship of Theseus right now - I literally replaced nearly everything trying to get it back working.

    The problem ended up being a silent out of memory error that php-fpm was running into. I had to increase the memory limit to about 10x what the docs require to get it to work, but once I did that, it works great.

    I was only able to sort this out after @bentigorlich recommended I move the site to debug mode (which requires me to lock everyone else out). Once I did that, it started giving some useful errors.

    My apologies for the amount of time it took to fix this. I learned a lot about php today.

    Fedia Discussions @fedia.io
    jerry @fedia.io

    Hi all. As some of you have reported, outbound federation to at least some other instances is broken from fedia.io. At the moment. I don't know why and I don't have any leads as there are no logs or other indications of what is going wrong, but I am working on it.

    Fedia Discussions @fedia.io
    jerry @fedia.io

    Fedia.io federation woes

    Hi all. Several of you have reported problems with fedia.io not federating with other instances correctly.

    The cause is that rabbitmq crashed, but not all the way. It crashed to the point where new connections would timeout, but the service was still running such that it wouldn't auto restart. I will be creating some automation to detect that proactively and restart rabbitmq if/when it happens again.

    Fedia Discussions @fedia.io
    jerry @fedia.io

    We made some changes a few minute ago that we hope fixes the problem. There may be some other lingering issues, but I am hoping the voting problem is fixed now. Let me know below if you continue to see that problem.

    Fedia Discussions @fedia.io
    jerry @fedia.io

    Until I implement a better system to screen out spammers, I will be closing registrations on Fedia.io. That’s not what I want - I’d like for it to be available for legitimate accounts, but the spam is off the hook.

    Anyone seeing this can send me an email (jerry@infosec.exchange) and I’ll get an account created for you in the mean time.

    Fedia Discussions @fedia.io
    jerry @fedia.io

    Fedia.io now behind Fastly CDN

    Hello everyone. Today, I moved fedia.io behind the Fastly CDN. This should make the site consistently fast for everyone, no matter where you are in the world. It'll also help with bandwidth usage and mitigate DDoS attacks.

    There were a few hiccups as I set that up today - my apologies if you saw errors or broken images for a bit.
    EDIT: I previously said that this was the first time mbin or kbin was put behind a CDN. That is incorrect. kbin.earth has been behind Cloudflare. Apologies.

    Fedia Discussions @fedia.io
    jerry @fedia.io

    Fedia.io instability

    Hi all. I've been having some problems keeping fedia.io running - at the moment, either the message workers or the php web server processes are dying after an hour or so and I have to restart everything. I have been working with the mbin team and installed some updates that we hoped would fix the problems, but no luck. I am going to work on a cron job to automatically restart things once an hour. The down side, is that you'll likely see some error 500's if you happen to hit it when the processes are restarting, but it should happen quickly and refreshing the page should make it work again.

    test @infosec.pub
    jerry @fedia.io

    second mbin to lemmy upload test

    Fedia Discussions @fedia.io
    jerry @fedia.io

    Unplanned downtime

    Shortly after upgrading to Mbin 1.7.1-rc1, php ran out of workers. I dramatically increased the limit. It isn’t clear to me why that happened and if it’s related to the upgrade or just coincidental. My intuition is that it’s related, but I have no evidence.

    Fedia Discussions @fedia.io
    jerry @fedia.io

    Fedia.io upgraded to mbin 1.7.1-rc1 + important note about streaming updates

    Hello everyone. I just upgraded fedia.io to mbin 1.7.1-rc1. One of the notable changes is that mbin is deprecating mercure, which is the component that provided streaming updates. As such, you will have to refresh the web page to see new posts and comments.

    Fedia Discussions @fedia.io
    jerry @fedia.io

    The (relatively new) server that Fedia.io was running on, a Hetzner AX 162-R, died overnight. Hetzner tells me that the main board failed and had to be replaced. In the process of repairing, the raid set got corrupted and would no longer boot.

    Every single AX 162 (R or M) I’ve rented from Hetzner has failed now at least once. This was the last one I had. It was on my to do list to move fedia.io to a Dell server with the same specs. I knew this was going to happen, but I didn’t get it done in time.

    For those of you who have been following along, Fedia has been cursed from the beginning. The kbin software was a god damned disaster, and very fortunately the mbin team spent an incredible amount of time and patience to help me sort out the many problems, nearly all of which are fixed now.

    Except for the random occurrences where federation breaks due to an as-yet-unknown bug, the main stability issue has been hardware. I have had excellent luck with Hetzner’s Dell servers, so I a

    Fedia Discussions @fedia.io
    jerry @fedia.io

    I will be rehoming fedia.io to a less expensive server the afternoon of July 1 - exact timing is TBD. Downtime should last about 2 hours. The current server is quite expensive and donations are dwindling, which is normally ok, but I am losing my job and have to be a bit more frugal.

    Fedia Discussions @fedia.io
    jerry @fedia.io

    Yesterday, the fedia.io server locked up. I was able to reboot it remotely and it came up clean. After less than an hour, the server froze again. This happened several more times throughout the day. Unfortunately, there were no logs recording what happened, and nothing on the console - just frozen hardware.

    I contacted Hetzner early this morning and they diagnosed the server as having a faulty motherboard. Hetzner replaced the board and rebooted the server, and so far the server has been stable.

    I have had pretty bad luck with this particular model of server from Hetzner, so I do not have confidence that this won't happen again, and so will be looking to migrate to a different type of server that is hopefully more stable and less expensive (I am losing my job at the end of June, and so need to save all the cash I can).

    Fedia Discussions @fedia.io
    jerry @fedia.io

    Fedia stablility update

    Fedia.io had a few issues over the past 24 hours - sometimes working find till you click on certain posts, which result in an error 500, and other times just getting an error 500 no matter what.

    The first issue I found is that amqproxy, which helps to reduce the load on the server between the queue runners that process incoming and outgoing posts and rabbitmq. I found this morning that amqproxy was consistently failing, despite there being no apparent problem. I bypassed amqproxy, since the server can handle the load fine without amqproxy. That seemed to work and things returned to normal. A few hours later, the site started responding with error 500 to nearly all requests. This happened because the database server ran our of connections. The 300 it was set to should have been plenty, but clearly it was not. I've set that to 3000 and so far, so good.

    My apologies for the instability. I continue to learn the nuances here and will keep making the service more reliable as