Register a SA Forums Account here!
JOINING THE SA FORUMS WILL REMOVE THIS BIG AD, THE ANNOYING UNDERLINED ADS, AND STUPID INTERSTITIAL ADS!!!

You can: log in, read the tech support FAQ, or request your lost password. This dumb message (and those ads) will appear on every screen until you register! Get rid of this crap by registering your own SA Forums Account and joining roughly 150,000 Goons, for the one-time price of $9.95! We charge money because it costs us money per month for bills, and since we don't believe in showing ads to our users, we try to make the money back through forum registrations.
 
  • Post
  • Reply
ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
Someone whipped up their own variant of the methods used for DALL-E, so you can try it here:

https://huggingface.co/spaces/dalle-mini/dalle-mini

The results are not as impressive as DALL-E 1 or 2, but it definitely can do some creative stuff

Adbot
ADBOT LOVES YOU

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock

The Alchemist posted:

With such AI algorithms the "computer, enhance" meme/trope will be real life and people in the future will see our posts mocking CSI etc. thinking we are dumb as poo poo. And they will be correct.

Not really - while AI can be used to enhance pictures so they look good at higher resolution, there is no guarantee that the enhanced picture is anywhere close to the truth of the pictured situation. Like when an AI made a pixelated Obama into a white man

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock


chill it with the cucumbers, lower right robot

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
Got inspired by this one:
https://twitter.com/hardmaru/status/1525787886284849152






Hmm....that last one didn't work out so well.


Better.

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
They want to avoid enabling prompts like «Obama giving a blowjob to Soros in the Oval Office» or «Trump loving his daughter» or anything involving children, which is understandable. But since they don’t know if there are ways to bypass the filters, they also filter users do they can manage it better.

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
A challenger appears

A blue jay standing on a large basket of rainbow macarons.



A robot couple fine dining with Eiffel Tower in the background.

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
furry artists are soon out of a job

https://twitter.com/LitRPGforum/status/1528668105970536448

also browse this guy's feed, he's really churning dalle2 for all that it's worth, including telling it to make game assets

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock

precision posted:

IF NOUN1 VERB NOUN2 THEN DO NOT OUTPUT NOUN2 VERB NOUN1

see, i did it

but this isn't a bunch of IF statements, it is putting the sentence into a neural network that then outputs a set of numbers, and then that set of numbers is fed into the image generator

if it was that easy, why would both openAI and google fail to fix it

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock

precision posted:

imagine if they managed to prove that thoughts occur 0.0000001 seconds after the action we think they refer to, proving forever that we literally don't have any control over our actions and only think about them after they happen

they already did that, iirc

edit: https://www.nature.com/articles/news.2008.751

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock

LifeSunDeath posted:

"It's Morbin' Time Jared Leto looking stupid"



"Jared Leto saying "It's Morbin Time" in a Burger King Restaurant Bathroom"








Those last two seem to just be the AI mashing up the original image. I guess once you get too specific, it recalls instead of creating

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
nice that it was trained before the movie, so we get the real morb, comic book morb

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
I'm wondering why Dall-E Mini even knows what Grogu is, because as far as I can tell two of the three datasets it uses for training does not contain him, and the third seems to only contain images and captions from before 2014.

edit: they might have updated the model after they wrote their paper, I guess - trying to find out exactly what an AI has been trained on is a bit hard, especially when I'm not versed in all the terminology

ymgve fucked around with this message at 16:00 on Jun 3, 2022

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock

err posted:

have any normal people gotten dall-e 2 yet?

every person i've seen has been affiliated with the tech industry in some way

hosed up coder brain is probably not unlocking the full potential of it yet

I assume people within the tech industry are the ones most probable to use social media to show off their creations, but I wouldn't put it past them to use another AI to decide who's invited first

I hope we all get unlimited access soon, though it will probably cost money like with GPT3

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
drat it is hard getting dalle mini to work locally on windows

edit: im currently installing something called bazel to compile a library for windows to get a bugfix that isnt in the binary release of something called jaxlib, and it seems to be a full framework and has compiled 6600 files so far

ymgve fucked around with this message at 20:28 on Jun 8, 2022

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
FINALLY got dall-e mini working on my computer. This is "groverhaus in lego"



"bathtub filled with tiny planets"

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
One thing I'm really impressed with is that even Dall-E Mini is great at "knowing" reflections. This is "flooded picadilly circus":



On the other hand, it has issues with other things. This is "a blue red panda" - it is a bit sad? Maybe that's how it interpreted "blue"



And then I tried a "a small blue panda" - and just got red pandas again.

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
because I'm a sensible adult I'm now trying to make it make porn, which is pretty hard - maybe some filtering in how it interprets words

https://imgur.com/a/9sCMs7T

https://imgur.com/a/QtZbz8r - some of these are NSFW, I guess

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
not good at context



had to test the online version to see if it was better tuned than the one I run locally, but it's really hard to make it actually make something that's not just an image of the full planet

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
Dall-E Mini completely forgets some parts of the input if it feels like it



It managed "blue apple" on its own, but only gave normal apples for "magenta apple" - I feel like it's a lot more focused when it had something close to it in the original training data (lots of google image hits for blue apples, but not that many for magenta)

Also compare and contrast "very shiny glass rabbit" (lots of google image hits, so it probably exists a few times in the training data)



vs "very shiny glass capybara"



Reformulating the query made it a bit better though - "capybara figure made of glass" gave these, and there is like one glass figurine on google images and it doesn't look like these at all



Bonus: "underwater phantom of the opera"

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock

KakerMix posted:

MidJourney is a ton better than dall-e mini, though I like that dall-e mini seems to handle certain things better anyway plus as far as I can tell, no text restrictions. I can't do anything with "Al Gore" because "gore" is filtered >:|

sunrise on mars


Yeah, but dall-e mini is free and open, which is a big plus, also no censoring

I managed to make some porn-like images, but it struggles with the concept of verbs so I haven't managed to make any images with "action" happening (yes I know there is porn on the internet shut up)

Also tried really hard to make any gore photos after you mentioned the censoring, but (thankfully) it seems the training data hasn't had much of that. I managed to make some dead pigs, but they were all fully intact and just lying down, with some slightly bloody spots on the skin.

I did manage to make it cough up these brilliant pieces though:



"human, damien hirst installation"

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
When the AI misunderstands brilliantly



"tower of babel, professional camera"

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
I have a new favourite thing









(most of these are variants of "octahedron of [material] on [place]", most often a coffee table)

edit: now we're getting more bold



ymgve fucked around with this message at 04:37 on Jun 9, 2022

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock

A Strange Aeon posted:

Can someone do a still from an unreleased live action Calvin and Hobbes movie by a director of your choice?

ok. "live action calvin and hobbes movie by wes anderson" - these were the live action ones, most were just comic book looking




"live action calvin and hobbes movie by david fincher"


"cgi calvin and hobbes movie by disney"


"cgi calvin and hobbes movie by pixar"


"live action calvin and hobbes movie by quentin tarantino"

ymgve fucked around with this message at 05:04 on Jun 9, 2022

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock


I cheated, "boy in red shirt with giant tiger [in place]"



boy in red shirt with giant tiger, movie directed by wes anderson

ymgve fucked around with this message at 05:25 on Jun 9, 2022

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
I've been trying to make a "just download and run" package for Dall-E Mini - I haven't tested in on a fresh Windows install, but it should in theory only need the Cuda 11.7 toolkit, then just run the .bat file:

https://mega.nz/file/oUty2STD#GYSku4Mm1y5k36vGiXdZcer3VWedj1fyde_2_uJZb2E - package with the small 1.7GB model, might work on older nvidia cards

https://mega.nz/file/FZkHlJga#_91XXGD3RGpihl4iRA3RzBo1UlaHBAB9qMMSCQfCobk - addon with the large 5GB model, definitely requires more GPU RAM

I got a 3080ti and it takes 10 seconds for each picture, so be prepared to have some patience

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock

Atoramos posted:

(I do have Python and the Cuda toolkit installed)

You need both packs unzipped in the same place, the mega model download is only the model

ymgve fucked around with this message at 05:34 on Jun 10, 2022

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
Yeah, the image reconstruction part of DALL-E Mini (vqgan imagenet) has issues with faces. Ironically, the best faces I got was with the phrase "brightly lit scene from a porno movie" (which were not brightly lit at all)

Technical details - the DALL-E Mini algorithm doesn't actually output an image, it outputs a sequence of 16384 numbers which you could call a "summary" of an image, not pixel by pixel but small and large features. Then it is fed into the VQGAN Imagenet network, which transforms those numbers into a 256x256 image.

ymgve fucked around with this message at 06:05 on Jun 10, 2022

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock

Vlaphor posted:

Speaking of faces...why did this only bring up faces?



try "butthole"

small snake in a snowglobe


samus aran painted by vincent van gogh (still very focused on starry night)


samus aran statue made by jeff koons


donald trump statue made by jeff koons


joe biden statue made by jeff koons


a piece of grilled chicken fillet with ketchup and pasta on a green dinner plate


star wars drawn by boris vallejo

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
Some more "X statue made by jeff koons"





edit: some more



ymgve fucked around with this message at 07:08 on Jun 10, 2022

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock

Atoramos posted:

Edit: of course. Yup, works perfectly.

Great! Could you tell me your hardware and your performance? It takes about 10 seconds to generate an image on a 3080ti here

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
I managed to get Disco Diffusion working locally too, but it's much slower than dall-e mini (takes a few minutes to generate each image) and the results have a certain fever dream "look"

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
Yeah, the neural net library seems to be like "grab ALL the GPU RAM" which makes sense for the standard use case where it runs on a linux server with no display

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock

An Ounce of Gold posted:

I have been using Disco Diffusion for a week and I am finally starting to crank out some good cartoon style concept art:


Wonderland:



Eternia's jungles from Masters of the Universe:


Caves with underwater lakes and glowing sprites:



I also made an IG account that I am dumping these things to... I've made a lot already. https://www.instagram.com/zooferai/


I don't see how this isn't going to replace like 80% of concept artists and design teams in the next few years.

Are you using the default settings from the workbook, or are you tweaking something? What are the exact prompts?

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock

GABA ghoul posted:

Hmm, I quickly tried to set up dall-e mini on my machine and the MEGA and MEGA_full models don't fit into my 6GB card. Is there a quick way to force it to fall back to CPU? Or do I have to actually try to understand what I'm doing to do that?

Open generator.py and place this line right after "import jax":

code:
jax.config.update('jax_platform_name', 'cpu')
But according to others, it takes like 15 to 30 minutes to generate a single image on the CPU, so probably not worth it.

You might have better luck going through the https://colab.research.google.com/github/borisdayma/dalle-mini/blob/main/tools/inference/inference_pipeline.ipynb notebook since that runs on google's servers

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock

Justa Dandelion posted:

Do you think it would make a difference if the training sets for agi were presented in a way where the concepts of safety, love, and kindness were heavily reinforced before exposing the ai to the information of the world? Sort of like an ideal ai childhood.

Worth a try, if it fails we can just wipe the AI and start again (but don’t tell the AI that)

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
Yeah, Disco Diffusion is pretty slow, partially because the defaults generate much larger images than Dall-E Mini. It also seems to have a much more artistic eye, but much less understanding of objects and language than Dall-E Mini.

Some of the better ones I made:


abandoned mall photo by edward burtynsky


landfill filled with floppy disks, award winning photography by edward burtynsky


spaceship graveyard photo by edward burtynsky

But it often fails in pretty interesting ways:


landfill filled with funko pops, award winning photography by edward burtynsky (it did get the basic idea, but just one giant pop)


a man sitting on the floor trying to fix a computer, painted by Jacques-Louis David, trending on artstation (anatomy? never heard of him)


mewtwo statue made by jeff koons (oh god)

Here's Dall-E Mini's attempts at some of the same prompts: (the landfill is too much for the AI to handle and it just devolves into a noisy background)

ymgve fucked around with this message at 17:14 on Jun 15, 2022

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
You should note that the OpenAI terms of service forbids sharing pictures with realistic faces, so be careful with where you share those

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
Also, if anyone has access to Dall-E 2, could you try the prompt "Landfill filled with funko pop figurines, award winning photography by Edward Burtynsky"?

edit: Also you could try some AI prompts that neither midjourney nor dall-e mini nor disco diffusion has gotten yet: "a red panda with blue fur, wildlife photography" or "a blue apple cut in half"

double edit: Thanks!

ymgve fucked around with this message at 17:07 on Jul 1, 2022

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock

External Organs posted:

Just a random thought - I wonder if you'd ever see Adobe license this technology and the results would be exported into separate layers in a .psd.

I don't want or think human artists are going away, but I could see AI-assisted stuff being serious business.

Wouldn't work without a complete retooling of how the AI creates images

Right now it's all a "package deal" - the AI generates the full picture all at once, there are no layers to separate

Adbot
ADBOT LOVES YOU

ymgve
Jan 2, 2004


:dukedog:
Offensive Clock
I think the drug dogs were made by fiddling with «neurons» in the model directly, to reveal what a neuron «sees»

  • 1
  • 2
  • 3
  • 4
  • 5
  • Post
  • Reply