• 0 Posts
  • 3 Comments
Joined 1 year ago
cake
Cake day: October 4th, 2023

help-circle
  • This assumes some kind of eureka innovation, right? A 96% reduction in compute demands per “token” is revolutionary. I haven’t seen anyone yet explain what that innovation is, exactly. There is also mixed reporting on how “open source” DeekSeek is, with many claiming it’s only “open weight,” meaning people are having difficulty reproducing the creation of the model. It wouldn’t be the first time that a claim out of China were false, and I think it wise to reproduce any such claims before running around with our arms in the air.