cm0002@lemmy.worldEnglish · 10 days agoThe Attention Mechanism Born for Cost Optimizationplus-squareoilbeater.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkThe Attention Mechanism Born for Cost Optimizationplus-squareoilbeater.comcm0002@lemmy.worldEnglish · 10 days agomessage-square0fedilink
cm0002@lemmy.worldEnglish · 11 days agodcdaML - devanagari character detection dataset training frameworkplus-squaregithub.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkdcdaML - devanagari character detection dataset training frameworkplus-squaregithub.comcm0002@lemmy.worldEnglish · 11 days agomessage-square0fedilink
cm0002@lemmy.worldEnglish · 18 days agoNeural Graffiti is an experiment in adding a "Spray Layer" to a transformer model, which injects a memory trace into the final stages of inference without finetuning or retrainingplus-squaregithub.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkNeural Graffiti is an experiment in adding a "Spray Layer" to a transformer model, which injects a memory trace into the final stages of inference without finetuning or retrainingplus-squaregithub.comcm0002@lemmy.worldEnglish · 18 days agomessage-square0fedilink
kerntucky@infosec.pubEnglish · 2 months agoMalicious ML models found on Hugging Face Hubplus-squarewww.helpnetsecurity.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkMalicious ML models found on Hugging Face Hubplus-squarewww.helpnetsecurity.comkerntucky@infosec.pubEnglish · 2 months agomessage-square0fedilink
Charlie Fish@eventfrontier.comEnglish · 3 months agoVery inconsistent machine learning model trainingplus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareVery inconsistent machine learning model trainingplus-squareCharlie Fish@eventfrontier.comEnglish · 3 months agomessage-square0fedilink
Charlie Fish@eventfrontier.comEnglish · 6 months agocoremltools Error: ValueError: perm should have the same length as rank(x): 3 != 2plus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squarecoremltools Error: ValueError: perm should have the same length as rank(x): 3 != 2plus-squareCharlie Fish@eventfrontier.comEnglish · 6 months agomessage-square0fedilink
Charlie Fish@eventfrontier.comEnglish · 6 months agoTensorFlow Lemmy Communityplus-squareeventfrontier.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkTensorFlow Lemmy Communityplus-squareeventfrontier.comCharlie Fish@eventfrontier.comEnglish · 6 months agomessage-square0fedilink
Shamar@feddit.itEnglish · 6 months agoA community statement supporting the Open Source Definition (OSD)plus-squareosd.fyiexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkA community statement supporting the Open Source Definition (OSD)plus-squareosd.fyiShamar@feddit.itEnglish · 6 months agomessage-square0fedilink
tomjuggler@lemmy.worldEnglish · 1 year agoAlternative to Generating images: get AI to generate query for real image (Unsplash)plus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareAlternative to Generating images: get AI to generate query for real image (Unsplash)plus-squaretomjuggler@lemmy.worldEnglish · 1 year agomessage-square0fedilink
keepthepace@slrpnk.netEnglish · 1 year agoWho else here loves the end-to-end robotics model that seem to go out on a weekly basis?twitter.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkWho else here loves the end-to-end robotics model that seem to go out on a weekly basis?twitter.comkeepthepace@slrpnk.netEnglish · 1 year agomessage-square0fedilink
taaz@biglemmowski.winEnglish · edit-21 year agoModel Design Theory Tips/Tricks/Docs (for a card game agent)plus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareModel Design Theory Tips/Tricks/Docs (for a card game agent)plus-squaretaaz@biglemmowski.winEnglish · edit-21 year agomessage-square0fedilink
spaduf@slrpnk.netEnglish · 1 year agoTransformer-Based Large Language Models Are Not General Learners: A Universal Circuit Perspectiveplus-squareopenreview.netexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkTransformer-Based Large Language Models Are Not General Learners: A Universal Circuit Perspectiveplus-squareopenreview.netspaduf@slrpnk.netEnglish · 1 year agomessage-square0fedilink
filister@lemmy.worldEnglish · 1 year agoGPU Recommendationplus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareGPU Recommendationplus-squarefilister@lemmy.worldEnglish · 1 year agomessage-square0fedilink
ylai@lemmy.mlEnglish · 1 year agoUnderstanding GPU Memory 2: Finding and Removing Reference Cyclesplus-squarepytorch.orgexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkUnderstanding GPU Memory 2: Finding and Removing Reference Cyclesplus-squarepytorch.orgylai@lemmy.mlEnglish · 1 year agomessage-square0fedilink
tomjuggler@lemmy.worldEnglish · 1 year agoI hired a pirate to take orders for my entertainment business - Circus Scientistplus-squarewww.circusscientist.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkI hired a pirate to take orders for my entertainment business - Circus Scientistplus-squarewww.circusscientist.comtomjuggler@lemmy.worldEnglish · 1 year agomessage-square0fedilink
spaduf@slrpnk.netEnglish · 1 year agoTheoretical Foundations of Graph Neural Networks - Seminarplus-squarewww.youtube.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkTheoretical Foundations of Graph Neural Networks - Seminarplus-squarewww.youtube.comspaduf@slrpnk.netEnglish · 1 year agomessage-square0fedilink
spaduf@slrpnk.netEnglish · edit-21 year agoFull MIT Lectures on Machine Learning in Genomicsplus-squarewww.youtube.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkFull MIT Lectures on Machine Learning in Genomicsplus-squarewww.youtube.comspaduf@slrpnk.netEnglish · edit-21 year agomessage-square0fedilink
Wilshire@lemmy.worldEnglish · 2 years agoTraining AI to Play Pokemon with Reinforcement Learningplus-squareyoutu.beexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkTraining AI to Play Pokemon with Reinforcement Learningplus-squareyoutu.beWilshire@lemmy.worldEnglish · 2 years agomessage-square0fedilink
LoveOxygenProducers@lemmy.worldEnglish · 2 years ago[R] Unraveling the Mysteries: Why is AdamW Often Superior to Adam+L2 in Practice?plus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-square[R] Unraveling the Mysteries: Why is AdamW Often Superior to Adam+L2 in Practice?plus-squareLoveOxygenProducers@lemmy.worldEnglish · 2 years agomessage-square0fedilink
abhi9u@lemmy.worldEnglish · 2 years agoAn Analysis of DeepMind's 'Language Modeling Is Compression' Paperplus-squarecodeconfessions.substack.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkAn Analysis of DeepMind's 'Language Modeling Is Compression' Paperplus-squarecodeconfessions.substack.comabhi9u@lemmy.worldEnglish · 2 years agomessage-square0fedilink