Original text below this.

Following release of lemmy 0.19.18 we’ve refederated as this release should fix the bug that caused the issue.

TL;DR

We’ve temporarily defederated from Hexbear due to a Lemmy bug with very deeply nested comment threads.

A thread there triggered repeated crashes on our server, causing errors like 502 pages and “Lemmy is starting” messages. Defederating stops the issue for now.


Announcement

Due to technical issues, we’ve temporarily defederated from Hexbear until a Lemmy update is available that fixes issues with deeply nested comment chains.

There is a known bug in Lemmy (see: https://github.com/LemmyNet/lemmy/issues/6435 ) where very deeply nested comments can trigger excessive recursion during federation. When Lemmy processes these comments, it recursively fetches and verifies parent comments, which can eventually lead to stack overflows.

Under normal circumstances this happens rarely (we’ve been seeing it maybe once per day), but it becomes much more problematic when multiple new comments are added to an already deeply nested thread. Each new activity can trigger processing of the same deep chain again.

In this case, a thread on Hexbear received a large number of additional replies in a very deep comment chain.

This caused Lemmy to repeatedly process that chain, leading to stack overflows, federation worker exhaustion and timeouts. Simply put, parts of the server were crashing, too many tasks piled up at once, and requests started timing out and failing to load

You may have see this on the website with 502 errors or the lemmy error screen, and on apps it may have presented you with API timeout errors or “Lemmy is starting” errors.

For a visual representation, this graph shows the memory drop each time the server restarts:

The flat bit to the left is good, everything is fine. The choppy bit to the right, not so good, everything is not fine.

Usually its a one-off comment causing this crash, however in this case the user spent a good portion of time bumping the thread, and we had to process each one of those, each causing a crash, restarting the server, and then crashing on the next in the queue, and so on.

I did try removing the offending community from Lemmy.zip to prevent this from happening (It’s quite common behavior in that community to bump threads I think), however we still process all the activities from that community - the only certain fix for now is to defederate until a version of lemmy is released that fixes this.

The graph is back to improving now:

Hope that all makes sense!

Demigodrick

  • chemical_cutthroat@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    3 days ago

    We should really give this a while. Take a few months and make sure the fix is set before we bring them back in. Maybe even a full year.

        • forrgott@lemmy.zip
          link
          fedilink
          English
          arrow-up
          0
          ·
          3 days ago

          Except this conversation is NOT about “the state of Lemmy”, it’s a comm specifically about this instance.

          And no, butting your head in where it clearly doesn’t belong is not appreciated.

        • NewDark@lemmy.today
          link
          fedilink
          English
          arrow-up
          0
          ·
          3 days ago

          It’s the myth of consensual sex federation meme, but it’s hexbear, lemmy.zip saying “I consent” and this goober saying “I don’t

        • ramble81@lemmy.zip
          link
          fedilink
          English
          arrow-up
          0
          ·
          3 days ago

          You’re on .world, your opinion doesn’t matter. I’m native here on .zip and one of the reasons I am here is because the admins don’t defederate unless it’s a technical reason like this. They leave me to act like an adult and choose which instances I want to block or not.

          Enjoy your walled garden of babysitting by the admins.

          • Cricket@lemmy.zip
            link
            fedilink
            English
            arrow-up
            0
            ·
            2 days ago

            I’m native here on .zip and one of the reasons I am here is because the admins don’t defederate unless it’s a technical reason like this. They leave me to act like an adult and choose which instances I want to block or not.

            Exactly the same here. Thank you @Demigodrick@lemmy.zip!

        • ScientifficDoggo@lemmy.zip
          link
          fedilink
          English
          arrow-up
          0
          ·
          3 days ago

          I think they meant to imply that your opinion in the matter doesn’t matter since you’re not affected AND trying to influence other people’s experience…