• 1 Post
  • 23 Comments
Joined 10 months ago
cake
Cake day: March 22nd, 2024

help-circle



  • My friend, the Chinese have been releasing amazing models all last year, it just didn’t make headlines.

    Tencent’s Hunyuan Video is incredible. Alibabas Qwen is still a go to local model. I’ve used InternLM pretty regularly… Heck, Yi 32B was awesome in 2023, as the first decent long context local model.

    …The Janus models are actually kind of meh, unless you’re captioning images, and FLUX/Hunyuan Video is still king in diffusion world.



  • As implied above, the raw format fed to/outputed from Deepseek R1 is:

    <|begin▁of▁sentence|>{system_prompt}<|User|>{prompt}<|Assistant|><think>The model rambles on to itself here, “thinking” before answering</think>The actual answer goes here.

    It’s not a secret architecture, theres no window into its internal state. This is just a regular model trained to give internal monologues before the “real” answer.

    The point I’m making is that the monologue is totally dependent on the system prompt, the user prompt, and honestly, a “randomness” factor. Its not actually a good window into the LLM’s internal “thinking,” you’d want to look at specific tests and logit spreads for that.


  • Zero context to this…

    My experience with Deepseek R1 is that it’s quite “unbound” by itself, but the chat UI (and maybe the API? Not 100% sure about that) does seem to be more aligned.

    All the open Chinese LLMs (Alibiaba’s Qwen, Tencent, InternLM, Yi, GLM) have been like this, rambling on about Tiananmen Square as much as they can, especially if you ask in English. The Chinese tech devs seem to like “having their cake and eating it,” complying with the govt through the most publicly visible portals while letting the model rip underneath.

    Contrast this with OpenAI’s approach of opaquly censoring the model, probably at the weights level, which neuters its intelligence and prose even in other tasks. Oh, and keeping every single detail closed and proprietary.



  • brucethemoose@lemmy.worldtoScience Memes@mander.xyzWobble Wobble
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    7 days ago

    Honestly we are way past the point of any scientific reasoning. The public has voted that they are uninterested, and the US government and large corporations are about to be uninterested too.

    To be blunt… No one ever really cared, but the world kinda squeaked by putting scientists in front of statesmen and public broadcasts. Everyone kinda nodded along, and not just for global warming.

    That period is over.



  • Both these seem awesome. But I have two controversial opinions on this.

    First, I think there are way too many “lone mod hero communities” when there are others trying to do almost the same thing, and I think there should be more collaboration? Like maybe y’all should repost in others subs, selectively, or something.

    Apologies if that seems offensive, but I am very touchy about this. I live in a world where too many coding projects, open source and corporate, just reinvent the wheel either because they didn’t spot the other project… Or ignore each other for other reasons. The open source space is in desperate need of more integration, and that extends to federated social media.

    Second, on your sub’s rules @jordanlund, I have many objections to YouTube, but there are many creators on there who (to me) absolutely qualify as verified news sources with all the evidence/citations they show, more than some major websites. Some delivering news you’ll find nowhere else simply because it isn’t posted in text format. Hence I have… mixed feelings banning YouTube as a source, as I do understand the need to keep the funk away.


  • The real excuse from Elons fans is “It’s just a joke.” Or “He’s trolling libs.”

    Which is almost as bad, even if true.

    It reminds me of school bullies that would make cutting, abusive “jokes” and follow it up with “Just kidding, bruh” if it doesn’t land right. And these are some of the worst human beings I have ever personally encountered.

    How can people worship something like that at such scale? Like, I wouldn’t even wish that on the most raging Nazi, it’s worse than breaking their jaw.


  • I mean, there’s a real issue.

    Say you were china, or the EU, or any other country/bloc and basically your entire youth was addicted to Twitter, Facebook or whatever, and officially manipulatable by the US government…. And you got into a real conflict. Maybe even a hot war.

    Wouldn’t you be worried about the US propagandizing your population?

    I would.

    The US government’s solution is completely dysfunctional and not getting at the root of the issue because they are afraid of reducing the power projection of big tech, among other things. But the core issue doesn’t need to be trivialized.