Is anyone actually surprised by this?

  • AbouBenAdhem@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    2 days ago

    Anyone using DeepSeek as a service the same way proprietary LLMs like ChatGPT are used is missing the point. The game-changer isn’t that the Chinese company DeepSeek can compete with OpenAI and its ilk as an AI service provider—it’s that now any organization with a few million dollars to train and host their own model can compete with OpenAI.

    • naeap@sopuli.xyz
      link
      fedilink
      arrow-up
      0
      ·
      2 days ago

      I’d like to look into that, how can I train an existing model further?

      I’m only playing around with ollama, but like to do a bit more - mostly just to fulfill my needs to understand things - but have no idea where to start

        • naeap@sopuli.xyz
          link
          fedilink
          arrow-up
          0
          ·
          edit-2
          2 days ago

          Python is not a problem
          SW Dev is my job. Just never had real contact with AI before, besides playing around a bit.

          Thank you very much for the link!!

          Edit: thank you very much again, that was pretty much exactly what I was looking for.
          Don’t know how I missed to checkout huggingface. Thought of it always just as a github for models and didn’t bother checking for docs…
          But that’s a great intro with simple tools/tutorials to get a grip on it, thanks!

      • mac@lemm.ee
        link
        fedilink
        arrow-up
        0
        ·
        edit-2
        1 day ago

        Onprem has always been cheaper. Cloud compute was the most successful marketing campaign I can think of.