Register a SA Forums Account here!
JOINING THE SA FORUMS WILL REMOVE THIS BIG AD, THE ANNOYING UNDERLINED ADS, AND STUPID INTERSTITIAL ADS!!!

You can: log in, read the tech support FAQ, or request your lost password. This dumb message (and those ads) will appear on every screen until you register! Get rid of this crap by registering your own SA Forums Account and joining roughly 150,000 Goons, for the one-time price of $9.95! We charge money because it costs us money per month for bills, and since we don't believe in showing ads to our users, we try to make the money back through forum registrations.
 
  • Post
  • Reply
Mescal
Jul 23, 2005

Llama 3 is getting some noise. There's a small version people are running on small hardware.
about https://ai.meta.com/blog/meta-llama-3/
try https://www.meta.ai/

Mescal fucked around with this message at 16:43 on Apr 24, 2024

Adbot
ADBOT LOVES YOU

Tarkus
Aug 27, 2000

I've been doing some basic tests and the 8B model is pretty decent, it does better in some programming and reasoning tasks than larger models like Mixtral 8x7b and Mistral Large. I think the suspicion that earlier models are over-parameterized is true. Microsoft has a 3.8B model coming out soon and they say it's good, who knows.

Hadlock
Nov 9, 2004

Yeah Microsoft is claiming just yesterday they used some novel training methods akin to reading it children's books to get gpt 3.5 performance (which at one point had 170b params, according to the Internet, although I think the turbo model had half as much) out of 3.8b params which would be... Like almost 1/44 the size, which if true :golfclap: but I'm skeptical as that would mean ChatGPT on a raspberry pi

Tarkus
Aug 27, 2000

Yeah, Microsoft had a white paper last summer starting off their PHI series where they train it using only textbook quality data. One thing I do know is that for decades Microsoft or Bill Gates has been archiving all kinds of data and trying to find permanent ways to store it for the future, I would imagine that those efforts would give them access to those kinds of things.

  • 1
  • 2
  • 3
  • 4
  • 5
  • Post
  • Reply