• 0 Posts
  • 9 Comments
Joined 6 months ago
cake
Cake day: August 14th, 2024

help-circle
  • I will also quickly cancel the student visas of all Hamas sympathizers on college campuses, which have been infested with radicalism like never before.

    Wow, nothing says freedom of speech like telling folks to not expect it if they become a citizen. Like typically you put on your best on for guest, but I guess the United States is now at the “get the fuck out” stage of encouraging young engineers to the country.

    …Except if you’re willing to be subjected to a work visa.

    I mean it’s honestly telling everyone who we really are here. We don’t want you unless you’re willing to be a wage slave. I mean President is free to do that, but damn that’s really sending a message there.


  • Copenhagen-hosted DistroWatch says it has tried to appeal against the Community Standards-triggered ban. However, they say that a Facebook representative said that Linux topics would remain on the cybersecurity filter.

    Nope, this one isn’t ignorance, it’s actual malice. They fully intended to start blocking Linux topics.

    When you take this and pair it with what Larry Ellison just recently said:

    AI will ensure “citizens will be on their best behavior”

    There tends to be a pattern forming that I really don’t want to draw because I like tinfoil on my head.


  • So is the rout based on the idea that the need for training hardware is much smaller than suspected even if the operation cost is the same… or is the stock market just clueless and dumb and they’re all running on vibes at all times anyway?

    Two parts here.

    1. nVidia is over valued, everyone has known this but nobody wanted to call. Someone clicking a decent model on a fraction the resources was good as anyone to call the bluff.
    2. Lots of the folks who are in it for nVidia believe that companies are going to need chips out the ass to keep up. It’s getting ahead of everyone to say “that’s no longer true”, but for reasons there’s a good chance the chip expectation isn’t as big as nVidia was painting.

    As for the model.

    This model is from China and trained there. They have an embargo on the best chips, they can’t get them. So they aren’t supposed to have the resources to produce what we’re seeing with DeepSeek, and yet, here we are. So either someone has slipped them a shipment that’s a big no-no OR we take it at face value here that they’ve found a way to optimize training.

    The neat thing about science is reproducibility. So given the paper DeepSeek wrote and the open source nature of this. Someone should be able to sit down and reproduce this in about two month (ish). If they can, nVidia is going to have a completely terrible time and the US is going to have to rethink the whole AI embargo.

    Without deep diving into this model and what it spouts, the skinny is that nVidia has their top tier AI GPUs. It has all these parts cut into the silicon that makes creating a model cost a lot less in kilowatts of power. DeepSeek says they were able to put in some optimizations that gets you a model on low kilowatts by optimizing some of the parts found only in the top tier AI GPUs.

    Blah blah example of this DeepSeek used 32 of the 132 streaming multiprocessors on their Hopper GPUs to act as a hardware accelerated communication manager and scheduler. Top tier nVidia cards for big farms do this in their hardware already in a circuit called the DPU. Basically DeepSeek found a way to use their Hopper GPUs to do the same function as nVidia’s DPUs.

    If true, it means that the hardware nVidia is popping into their top tier isn’t strictly required. It’s nice, and you’ll still get a model on less kilowatts than the tricks DeepSeek is using, but DeepSeek’s tricks means the price difference between top tier and low tier needs to be a lot closer than it is to stay competitive. As it stands with DeepSeek’s tricks (again, if they prove to be correct) is that if you’ve got a little extra time, you can get bottom tier AI GPUs and spend about the same kilowatts for what the top tier will kick out with a hint less kilowatts. The difference in cost of kilowatts between the amount you spend on low tier and amount you spend on kilowatts on top tier isn’t enough to justify the top tier’s price difference from the low tier, if time is not a factor.

    And so that brings us full circle here. If someone is able to reproduce DeepSeek’s gains, nVidia’s top tier GPUs are way over priced and their bottom tier is going to sell out like hotcakes. That’s bad for nVidia if they were hoping to, IDK, make ridiculous profit. And that is why the sudden spook in the market. I mean, don’t get me wrong, folks have been looking forward to popping nVidia’s bubble, so they’ve absolutely been hyping this whole thing up a lot more. And it didn’t help that it came top #1 on the Apple App Store.

    So some of this is those people riding the hate nVidia train. But some of it is also, well this is interesting if true. I think it’s a little early to start victory laps around nVidia’s grave. The optimizations purposed by DeepSeek have yet to be verified as accurate. And things are absolutely going to get interesting no matter the outcome. Because if the purposed optimizations don’t actually produce the kind of model DeepSeek has, where did they get it from? How did they cheat? Because then that’s an interesting question in of itself, because they aren’t supposed to have hardware that would allow them to make this. Which could mean a few top tier cards are leaking into China’s hands.

    But if it all does prove true, well, he he he, nVidia shorts are going to be eating mighty well.






  • Digital circuits. Went to college for Electromechanical engineering. Got really into digital circuits.

    Got out of college and pretty much fell into computer programming. Fast forward several decades and I finally land a job with same hours and good pay with life in a semi stable state.

    Decide to hobby my enjoyment for digital circuits. Fucking chip shortage happens and getting MCUs and GALs (among others) become harder to get.

    It’s slowly gotten better, but it had me asking if the universe didn’t want me doing digital circuits.


  • You have to understand how Vance views the issue. For him abortion isn’t a meet in the middle stance. Abortion to him and folks similar see the matter as only having one possibly correct solution.

    Thus for him, “Americans instinctively mistrust us” doesn’t mean that his position would evolve, it’s that “to him”, he’s done a “bad job” making your stance evolve.

    The hard line Republicans aren’t interested in finding common ground, they’re more interested in what you will change your opinion to or at the very least what unacceptable positions you’ll tolerate. There is never going to be an evolution or common ground to be found with these folks because that’s distinctly not the position that they are looking for.