On Algorithms: Hexbear vs Lemmy

RedWizard [he/him, comrade/them]@hexbear.net · 4 days ago

On Algorithms: Hexbear vs Lemmy

dead [he/him]@hexbear.net · edit-2 4 days ago

I love the graphs. I love the enthusiasm you have.

I see a problem with your methodology. You are comparing the algorithm on posts which start with 1000 upvotes and never changes. Posts start with 1 upvote and the number of upvotes grows based on visibility in the algorithm. The algorithm used influences the number of upvotes that a post will get.

A post with 0 comments can not reach 1000 upvotes. Comparing the decay of 2 posts with the same number of upvotes does not make sense because the algorithm also changes the upvotes that a post will get. The lack of comments causes lack of visibility which causes diminishing upvote rate, which all compounds. A faster decay means that a post with no comments loses visibility even sooner.

From my experience, posts are primarily viewed in 2 ways, on ‘new’ sort and on ‘active’ sort. The front page of ‘new’ sort lasts around 1 hour typically and then users won’t upvote the post any more from ‘new’ sort. After 1 hour, the post only gains upvotes if it is visible in the ‘active’ algorithm. A post with 0 comments never even sees the front page of ‘active’ sort, therefore stops gaining upvotes at all after 1 hour. To compare the algorithms accurately, you would need to have a new function where a post only gains upvotes when the post is visible on the algorithm front page after the first hour.

I think it’s ineffectual to try to cull controversial threads using an algorithm. This purpose was originally served by downvotes. This function can likely only be replaced by active moderators, ie locking posts with excessive arguments.

The decay being exponential seems unnecessarily aggressive. Why can’t the decay be linear and still reach a value of 0 after 24 hours?

example: SCORE * ((24 - X) / 24), where X is the age of the post in hours.

Editing my post because I want to be precise and I want to be respectful, but some of my thoughts feel abstract. My complaint is that posts with important news are dying after 1-2 hours. My speculation is that exponential decay increases the chances that important news stories are disappeared into the void within 1-2 hours. Whether hexbear’s algorithm is giving a lower hypothetical score to controversial posts is a tangent.

RedWizard [he/him, comrade/them]@hexbear.net · 4 days ago

The second graph starts with 1 vote (the default) and each gain anywhere between 0 and 5 upvotes at random per hour. I could have a more robust simulation that makes the post get more votes the higher its rank is, but, in testing I learned that rank in both algorithms is demonstrably impacted by the number of hours since posting, far more then score. This is because the reward for high score count is logarithmically scaled.

The Hexbear algorithm allows posts that get comments in the first couple of hours a sizable boost, but that boost is diminished over time. Sustained conversation causes the thread to still taper off. Where as in the default, the boost is the same for two days and then violently ends.

The benefit is that a post will slip down the ranks as time passes and not dominate the front page.

The flaw as you point out is that threads with no comments die within hours. That is true for either algorithm. If something feels important, people should comment on it.

That decay is part of the standard Hot_Rank algorithm that is used in both. The base decay could be to aggressive. Any changes made to that will result in a more stale feed as all posts take longer to drop off.

One way this could be combatted is by doubling the score of a post with no comments or even changing the decay from 1.8 to something lower, such as 1.4 so it decays slower until it gets a comment.

This is why I want to build a simulation engine that can generate a a feed of posts, growing upvotes and growing comment counts.

I’m not convinced my methods here are perfect. But I think my general conclusions seem true. Comments are very important. In the other conversations here you’ll see two reasons given for the current algorithm: minimizing struggle sessions and ensuring the mutual aid comm isn’t totally washed out.

There could be other ways to handle this, such as explicitly making the posts from the mutual aid community rank higher or decay slower or both. But it requires more testing.

Sphere [he/him, they/them]@hexbear.net · 4 days ago

I was the one who reimplemented the old Hexbear sort algorithm after we merged back to upstream Lemmy and began federating. (I did not come up with the algorithm; that was someone else. I just took it from the old codebase and modified the newer codebase to do the same thing.)

I agree with your analysis; one of my goals in making the change was to stop struggle sessions from lasting two days straight, as they had begun doing under the default Lemmy Active sort.

As for @dead@hexbear.net’s complaints, it’s true that posts need comments to survive under the new algorithm, but that is even more true in default Lemmy Active sort, which simply replaces the posted_time as used in Hot sort with the last_comment_time_necro (_necro because it stops being updated after 48 hrs, hence the cutoff where posts finally drop off the feed). So a highly-upvoted thread that only has one person commenting on it every so often will stay right at the top of the feed.

And yes, the bump comments are an effort to abuse the algorithm, it’s true. That’s pretty much exclusively done in the mutual aid comm though, and I don’t think anyone is truly opposed to that here.

It’s also worth noting that those bots do not make any difference; the bump comment itself does the bumping, and the bot comments, which happen only seconds later, do basically nothing to further boost the post. When I get around to coding a bit that will wait 30 mins to respond, I hope we can ban those other bots from the mutual aid comm, so that threads don’t get cluttered with useless garbage comments that don’t even do anything useful.

Anyway, I don’t think either algorithm will fix the problem of important posts getting lots of up votes but no comments; both algorithms will send such a post down the feed rapidly, unfortunately. So, if you think a thread is important and want it to stay on the feed, you’ve gotta make a comment on it of some sort.

RedWizard [he/him, comrade/them]@hexbear.net · edit-2 4 days ago

Yeah, that’s my assessment as well, and the graphs really illustrate that. Just so I understand as well, in default Lemmy it looks like it uses the same rank calculation (hot rank) but simply passes the most recent comments timestamp to produce the “active” sort, and in Hexbear we implement a separate hot_rank_active function and change the part of the code that defines the Active sort to use this function instead, correct? Is it also possible that we could implement both? Default Active and Hexbear Active, or does that create a breaking change? (I imagine it does).

I’m not sure that struggle sessions are really being mitigated by the current algorithm if I’m being honest. If anything, it suggests to me that it encourages new posts regarding the struggle session since the old posts will fall off the front page, and new posts on the topic will generate many votes and comments, driving them up the feed.

It would be interesting if there was a way to boost threads with no comments, and then drive threads down when they generate “too many comments”. I have this metric that I go by when looking at threads on Reddit, which is, if the Score is Less Than the Number of Comments, then it’s clearly a controversial thread. I guess you could express that as controversy_ratio = (comments + 1) / (score + 1), which should give you how controversial a thread is.

10 comments and 100 post score = 0.1
50 comments and 100 post score = 0.5
100 comments and 100 post score = 1
150 comments and 100 post score = 1.5

I’m sure you could use this ratio to create a time penalty when calculating its rank. I’m not sure exactly how one would do that, I feel like the math is a little beyond me, and for all I know it might simply create a similar curve and not really result in different behavior. I’d need some kind of simulation that allows me to test these ideas against a feed of posts, where I can control the number of comments a thread gets over time, etc.

Sphere [he/him, they/them]@hexbear.net · 4 days ago

Your understanding is correct, yes.

Including both sorting algorithms (or a novel one in addition to the existing one) would definitely complicate the picture, though I’m not really sure how significantly. What I did was fairly simple, whereas adding a new sorting option would require changes to the UI code in addition to back-end changes like I made. It’s not impossible, but it would require more work, and would increase the odds of updates to upstream Lemmy requiring more work to merge in (if they change something in any of the parts of the code we’ve modified, there will be a merge conflict that has to be resolved).

Regarding struggle sessions, there’s no true solution, but I think having the thread fall off the feed fairly soon helps to mitigate them to at least some degree. I don’t have any data to back that up, though, so I could be wrong.

As for implementing a novel algorithm as a replacement to existing sort, I’m no better equipped than you to devise one–in fact, your graphs make clear that you’re more prepared to analyze algorithmic effects than I am–I just plugged a few values in to get a sense of what it was doing at various times after the post was made.

If you’re interested in working on something like that, it could be implemented on test.hexbear.net to see what effects it would have (if we refresh the data every so often, we can get at least some sense of how it affects post rankings). If you are, I suggest reaching out to an admin to get an invite to the dev chat. (I’m unfortunately not likely to have any time for working on Hexbear code anytime soon, though. Plus I don’t really know much of anything about Rust anyway.)

RedWizard [he/him, comrade/them]@hexbear.net · 4 days ago

Ah so not much a breaking change but one that requires integration. Def a lot of work and it wouldn’t be supported by 3rd party apps.

As for my aptitude, I’ve played around with numpy and pandas for other projects in the past. I’m not 100% sure if I graphed things accurately. I’m probably going to try and build a simulation that uses actual datetimes and implements the algorithms unchanged.

I’m curious, in the function byou wrote, where sis you come up with the 0.000012146493725346809 value? Is it simply arbitrary or is it something specific?

Sphere [he/him, they/them]@hexbear.net · edit-2 4 days ago

I didn’t write it; that was someone else back in the earliest days of Hexbear, back when it was still chapo.chat. I just took the SQL from the old codebase and ported it into the new one in the appropriate way to make it work.

As far as I know, it is arbitrary, though.

Sphere [he/him, they/them]@hexbear.net · 4 days ago

Made a few edits to the above comment after rereading your post.