The Artificial Intelligence & Machine Learning Megathread

Register a SA Forums Account here!
JOINING THE SA FORUMS WILL REMOVE THIS BIG AD, THE ANNOYING UNDERLINED ADS, AND STUPID INTERSTITIAL ADS!!!

You can: log in, read the tech support FAQ, or request your lost password. This dumb message (and those ads) will appear on every screen until you register! Get rid of this crap by registering your own SA Forums Account and joining roughly 150,000 Goons, for the one-time price of $9.95! We charge money because it costs us money per month for bills, and since we don't believe in showing ads to our users, we try to make the money back through forum registrations.

The Something Awful Forums > Discussion > Serious Hardware/Software Crap > The Cavern of COBOL > The Artificial Intelligence & Machine Learning Megathread

Jo: Jan 24, 2005; Soiled Meat

I was able to fine tune GPT-2 on a 3090 without problems, but I suspect anything of real utility will require much larger hardware. Perhaps instead it's worth taking StarCoder and self-hosting without any tuning?

# ¿ Sep 10, 2023 18:54

Adbot: ADBOT LOVES YOU

# ¿ May 21, 2024 01:53

Jo: Jan 24, 2005; Soiled Meat

I don't know how much drive there is for 64-bit ops. If anything, there's a move in the opposite direction to 16-bit floats. Even setting that aside, I'd be concerned about the market dominance of CUDA and all the special operators that are made for it. nVidia has a frustratingly huge lead on pretty much every front.

# ¿ Sep 27, 2023 19:22

The Something Awful Forums > Discussion > Serious Hardware/Software Crap > The Cavern of COBOL > The Artificial Intelligence & Machine Learning Megathread