noneabove1182@sh.itjust.worksMEnglish · 1 year agoBeginner questions threadplus-squarepinmessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareBeginner questions threadplus-squarepinnoneabove1182@sh.itjust.worksMEnglish · 1 year agomessage-square0fedilink
ikt@aussie.zoneEnglish · 2 days agoDid DeepSeek R1 just pop nvidias bubble?plus-squarewww.youtube.comexternal-linkmessage-square6fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkDid DeepSeek R1 just pop nvidias bubble?plus-squarewww.youtube.comikt@aussie.zoneEnglish · 2 days agomessage-square6fedilink
Smokeydope@lemmy.worldEnglish · edit-24 days agoWhy llms are suprisingly good at math, and what it means to process language.plus-squarelemmy.worldexternal-linkmessage-square19fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkWhy llms are suprisingly good at math, and what it means to process language.plus-squarelemmy.worldSmokeydope@lemmy.worldEnglish · edit-24 days agomessage-square19fedilink
Smokeydope@lemmy.worldEnglish · edit-25 days agoThoughts on new deepseek R1 distill modelsplus-squaremessage-squaremessage-square6fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareThoughts on new deepseek R1 distill modelsplus-squareSmokeydope@lemmy.worldEnglish · edit-25 days agomessage-square6fedilink
brokenlcd@feddit.itEnglish · 19 days agounsure on how to quantize modelplus-squaremessage-squaremessage-square2fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareunsure on how to quantize modelplus-squarebrokenlcd@feddit.itEnglish · 19 days agomessage-square2fedilink
🇦🇺𝕄𝕦𝕟𝕥𝕖𝕕𝕔𝕣𝕠𝕔𝕕𝕚𝕝𝕖@lemm.eeEnglish · 20 days agoHow much gpu do i need to run a 90b modelplus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareHow much gpu do i need to run a 90b modelplus-square🇦🇺𝕄𝕦𝕟𝕥𝕖𝕕𝕔𝕣𝕠𝕔𝕕𝕚𝕝𝕖@lemm.eeEnglish · 20 days agomessage-square0fedilink
Smokeydope@lemmy.worldEnglish · 20 days agoNvidia Digits AI Supercomputer just announcedplus-squarelemmy.worldexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkNvidia Digits AI Supercomputer just announcedplus-squarelemmy.worldSmokeydope@lemmy.worldEnglish · 20 days agomessage-square0fedilink
Halo@lemmy.worldEnglish · 26 days agoGo toolchain error - Does anyone know what's going on here? lemmy.worldexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkGo toolchain error - Does anyone know what's going on here? lemmy.worldHalo@lemmy.worldEnglish · 26 days agomessage-square0fedilink
hendrik@palaver.p3x.deEnglish · edit-21 month ago(New) papers by Meta: Large Concept Models and BLTplus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-square(New) papers by Meta: Large Concept Models and BLTplus-squarehendrik@palaver.p3x.deEnglish · edit-21 month agomessage-square0fedilink
BB84@mander.xyzEnglish · edit-21 month agoNew open-weight 🐋 DeepSeek V3. 685B MoE. Beats Claude 3.5 Sonnet on Aider coding benchmarkplus-squarehuggingface.coexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkNew open-weight 🐋 DeepSeek V3. 685B MoE. Beats Claude 3.5 Sonnet on Aider coding benchmarkplus-squarehuggingface.coBB84@mander.xyzEnglish · edit-21 month agomessage-square0fedilink
hok@lemmy.dbzer0.comEnglish · edit-21 month agoCan you fine-tune on localized steering of an LLM?plus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareCan you fine-tune on localized steering of an LLM?plus-squarehok@lemmy.dbzer0.comEnglish · edit-21 month agomessage-square0fedilink
sith@lemmy.zipEnglish · 2 months agoQuestions about HW for local LLM.plus-squaremessage-squaremessage-square1fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareQuestions about HW for local LLM.plus-squaresith@lemmy.zipEnglish · 2 months agomessage-square1fedilink
HumanPerson@sh.itjust.worksEnglish · edit-22 months agoFixed itplus-squaresh.itjust.worksexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkFixed itplus-squaresh.itjust.worksHumanPerson@sh.itjust.worksEnglish · edit-22 months agomessage-square0fedilink
hok@lemmy.dbzer0.comEnglish · 2 months agoLlama 3.3 70b - End of open-weight pretrained models from Meta or just a better Llama 3.1 405b finetune?plus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareLlama 3.3 70b - End of open-weight pretrained models from Meta or just a better Llama 3.1 405b finetune?plus-squarehok@lemmy.dbzer0.comEnglish · 2 months agomessage-square0fedilink
projectmoon@lemm.eeEnglish · edit-22 months agoOpenWebUI OpenStreetMap Tool 2.1.0plus-squareopenwebui.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkOpenWebUI OpenStreetMap Tool 2.1.0plus-squareopenwebui.comprojectmoon@lemm.eeEnglish · edit-22 months agomessage-square0fedilink
lynx@sh.itjust.worksEnglish · 2 months agoQwen2.5-Coder-7Bplus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareQwen2.5-Coder-7Bplus-squarelynx@sh.itjust.worksEnglish · 2 months agomessage-square0fedilink
Smorty [she/her]@lemmy.blahaj.zoneEnglish · 3 months agoHaving trouble to generate correct output? Try prefixes!plus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareHaving trouble to generate correct output? Try prefixes!plus-squareSmorty [she/her]@lemmy.blahaj.zoneEnglish · 3 months agomessage-square0fedilink
EffortlessOps@sh.itjust.worksEnglish · 4 months agoMeta unveils open-source Llama Stack, standardizing AI building blocks across the entire development lifecycle.plus-squaregithub.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkMeta unveils open-source Llama Stack, standardizing AI building blocks across the entire development lifecycle.plus-squaregithub.comEffortlessOps@sh.itjust.worksEnglish · 4 months agomessage-square0fedilink
brucethemoose@lemmy.worldEnglish · edit-24 months agoQwen2.5: A Party of Foundation Models!plus-squareqwenlm.github.ioexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkQwen2.5: A Party of Foundation Models!plus-squareqwenlm.github.iobrucethemoose@lemmy.worldEnglish · edit-24 months agomessage-square0fedilink
Smokeydope@lemmy.worldEnglish · 4 months agoTesting the Limits: My GTX 1070 Rig vs Mistral Small 22Bplus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareTesting the Limits: My GTX 1070 Rig vs Mistral Small 22Bplus-squareSmokeydope@lemmy.worldEnglish · 4 months agomessage-square0fedilink