I Just Wanted to Post an Image Thread

parkdalegardener · Sep 4, 2022

Ken1171 said:
By default Stable Diffusion adds an invisible watermark to the generated images, which are actually metadata tags, nothing on the image itself. It has also a self-imposed censorship of R-rated images. However, since this is open source, all of that can be disabled if you know your way around Python. In my location version, I have disabled all of it.

Here's a macro photography done with Stable Diffusion of a very small robot insect.

View attachment 75646

Actually, the "print" on the lower left hand corner is not an invisible watermark. It is most probably from the training data. I would suspect that most "english cottage" truths that the model is trained on come from public domain woodcut images. A lot of those when, published; have labeling on the lower left. The same training bias is visible using your prompt above. I generated 20 images with it and 18 of the results show the same orientation of the "insect" as your own dispite there being no orientation information in the prompt. While fun to mess with, I see the man behind the curtain. Takes away some fun as I find myself testing it's training to see the bias instead of being amazed by the results. Like all who download the code, I am aware of the NSFW lock in the code and really don't care to disable it. Like I said; I agreed not to make questionable images when I installed the software.

Ken1171 · Sep 4, 2022

parkdalegardener said:
Actually, the "print" on the lower left hand corner is not an invisible watermark. It is most probably from the training data.

That's not what I meant. The Python script on Stable Diffusion inserts a tag on the image to tell other programs that it was generated by AI. They call it an "invisible watermark". That's what I was referring to. That can also be removed. The embedded tag includes even the text prompt used to create the image.

parkdalegardener said:
Like all who download the code, I am aware of the NSFW lock in the code and really don't care to disable it. Like I said; I agreed not to make questionable images when I installed the software.

Sure, just remember the NSFW filter is only for images you will post online. It's a mandatory liability to protect themselves against legal issues related to TOS violations at 3rd party sites. That's what the "questionable images" refers to.

parkdalegardener · Sep 4, 2022

Yeah; I'm not worried about that. I find it a fun thing to play with as I like to figure out the training bias. Seed values for instance, appear to be a predictor of output style. Image size also seems to show training data as does image orientation. It's really quite fast on my card so I can kick out fast iterations of an expression. Punctuation is quite interesting when it comes to fine tuning an image though capitalization appears not. There appears no difference between Cat and cat in an expression. Below are all from the exact same words and and seeding with punctuation being the only tuning.

parkdalegardener · Sep 4, 2022

I'll stop flooding the thread with these though. When I grok the training bias enough I'll post something interesting.

Ken1171 · Sep 4, 2022

parkdalegardener said:
Punctuation is quite interesting when it comes to fine tuning an image though capitalization appears not.

That is correct. Capitalization is ignored, but punctuation affects the results because that can change the meaning of sentences in natural language interpretation. There are several examples of sentences where just changing the comma location can change its meaning significantly. The order of words is also important, and can get more weight depending on where they are in the sentence. Those are things that are good to know when typing prompts.

robert952 · Sep 4, 2022

parkdalegardener said:
I'll stop flooding the thread with these though. When I grok the training bias enough I'll post something interesting.

May I suggest a separate thread for this? When Mandelbulb3D and Apophysis first came out I didn't have the computer power to really run them. So, I didn't follow their development too much. But the concept intrigued me. That's part of why I got into 3D stuff when I came across DAZ Studio (some time just before V2.0).

I'd follow a thread on this. Consider the other programs that generate landscapes and bring in other 3D models (Flowscape for example).

Speaking of CGI (and a bit OT), I just watched the first two episodes the Rings of Power on Prime. Great visual experience.

parkdalegardener · Sep 5, 2022

robert, my interest in this may be temporary as I am currently trapped here recovering from a car accident. It passes the time and is easier for me right now than Poser, which is my usual poison of choice; as my movement is drastically reduced for the foreseeable future.

As for Rings of Power; I'm afraid I haven't seen it and am not likely to. Retired for years on a small fixed income means no Prime, no Amazon deliveries either for that matter. My friends in "The Industry" are somewhat unhappy at the amount of effects and the pace they were produced at for RoP but are happier at the outsourced pay rate than those that worked on House of the Dragon. I am also told that the effects work on HotD was sent in at a quality many are unhappy with due to the schedule but that is all hearsay till I see for myself.

Hornet3d · Sep 5, 2022

parkdalegardener said:
robert, my interest in this may be temporary as I am currently trapped here recovering from a car accident. It passes the time and is easier for me right now than Poser, which is my usual poison of choice; as my movement is drastically reduced for the foreseeable future.

As for Rings of Power; I'm afraid I haven't seen it and am not likely to. Retired for years on a small fixed income means no Prime, no Amazon deliveries either for that matter. My friends in "The Industry" are somewhat unhappy at the amount of effects and the pace they were produced at for RoP but are happier at the outsourced pay rate than those that worked on House of the Dragon. I am also told that the effects work on HotD was sent in at a quality many are unhappy with due to the schedule but that is all hearsay till I see for myself.

Sorry to hear about your car accident, I hope your recovery is a full one and achieved in double quick time.

RAMWolff · Sep 5, 2022

WOW. Take care. So sorry about the accident!

Miss B · Sep 5, 2022

Sending good wishes your way PDG! I also hope you recover sooner, rather than later.

robert952 · Sep 5, 2022

@parkdalegardener Hope you have a fast recovery.

Pendraia · Sep 5, 2022

parkdalegardener said:
robert, my interest in this may be temporary as I am currently trapped here recovering from a car accident. It passes the time and is easier for me right now than Poser, which is my usual poison of choice; as my movement is drastically reduced for the foreseeable future.

As for Rings of Power; I'm afraid I haven't seen it and am not likely to. Retired for years on a small fixed income means no Prime, no Amazon deliveries either for that matter. My friends in "The Industry" are somewhat unhappy at the amount of effects and the pace they were produced at for RoP but are happier at the outsourced pay rate than those that worked on House of the Dragon. I am also told that the effects work on HotD was sent in at a quality many are unhappy with due to the schedule but that is all hearsay till I see for myself.

Hope you're feeling better soon.

parkdalegardener · Sep 6, 2022

Thanks folks. I'll be fine

Kerya · Sep 6, 2022

I am wishing you a speedy recovery! {{{Hugs}}}

parkdalegardener · Sep 6, 2022

Thank you Kera. They had to replace rods again.

So with that being said. Yes, I have rods in my back. The folk over on PFD, like Kerya; have known that for years. Got me thinking and gave me a prompt to work with. Using the fact that SD is trained mostly on square images, a little twist or two will get a double result on rectangular images. It also gives heads growing out of heads but that was another experiment. Think of this as a little pdg self portrait remembrance of before and after.

Ken1171 · Sep 6, 2022

parkdalegardener said:
Using the fact that SD is trained mostly on square images, a little twist or two will get a double result on rectangular images.

I have noticed this too. When I use portrait or landscape aspect ratios, there is a good chance SD will duplicate the subject, or worse, parts of the subject, like 2 torsos, or 2 heads. If all training images were square, it doesn't know how to handle extended spaces because it has never seen it. However, I think all AIs were trained with square images, so if other AIs don't duplicate things, then there may be a way around this.

parkdalegardener · Sep 7, 2022

Ken1171 said:
I have noticed this too. When I use portrait or landscape aspect ratios, there is a good chance SD will duplicate the subject, or worse, parts of the subject, like 2 torsos, or 2 heads. If all training images were square, it doesn't know how to handle extended spaces because it has never seen it. However, I think all AIs were trained with square images, so if other AIs don't duplicate things, then there may be a way around this.

I've trained numerous models for specific recognition tasks. The training images do not have to be of a certain shape or size but they are converted to a similar size of the trainers choosing during the training. In short an image is converted to greyscale, resized to a standard training size, hashed, manipulated, hashed again. These last two are done as often (training epoch) as required by the trainer. These results build the database.

SD goes in the opposite direction. It is building the requested image by changing textual info into a hash and taking that hash to refine noise into an image containing all the hash info requested. It's method couldn't care what the output size is. It will create image data to fill the dimensions of the user. That is where the rub lies. Step two of training. Convert images to a standard size for manipulation during training. SD treats it's output as a square to match training data. That's why duplication of prompt info in the output. SD just tries to match hash data in the query to the hash data in it's database (SD model).

SD could be used to build out around an image, creating a partial wing on a butterfly for instance where the original photograph of the insect on a flower had part of a wing cut off in the original image.

We are getting way into the weeds on this one for a thread that is basically a showoff thread. I agree with robert. We should probably start a different thread about this stuff.

Ken1171 · Sep 7, 2022

The documentation claims SD requires dimensions to be multiples of 64, so it doesn't have to be square, but it also can't be any arbitrary size. Fair enough. The dimensions are also capped by how much VRAM we have, but we can upscale afterwards. However, if I want to make a 2K image, that is a problem because 1440 is not a multiple of 64. Maybe this can be compensated with outpainting, which is yet another cool thing SD can do.

parkdalegardener · Sep 7, 2022

Documentation? RTFM? You have to be kidding. I barely admit to being able to read at all. I'm the first person in my family that even finished high school.

I picked an arbitrary image ratio to work with. 704x384 or 512x512. I have no idea how much vRAM is on the system. I know I'm supposed to need a 3x series card and this machine is a 2070. I have another box with a 3080 and twice the system RAM as this machine does but that machine isn't for development and messing around.

As mentioned above in a late response to RAMWolff, I use video2x for upscaling. Open source, predominately it's for upscaling anime and ghibli but as it has the vulcan real ai model as well the others so it does stills in moments. It will use all the GPU cycles you throw at it for the coding but it is still a win32 app. Again; I use all this stuff on "under spec" equipment, and have yet to even have my fans turn on.

Woops; gotta post an image. That is the purpose of the thread...... There we go..

Ken1171 · Sep 7, 2022

parkdalegardener said:
I picked an arbitrary image ratio to work with. 704x384 or 512x512.

Wow, all 4 dimensions are precise multiples of 64, and you said you've picked them by random? You are very lucky! ^____^ I have found a WebUI version that automatically picks multiples of 64 when setting the dimensions, I can drop the calculator when using it.

Looks like they have a new model version 1.5 cooking up at the web version. It's still in beta, and depending on how testing goes, they might release it to the public in a week or two. It is meant to improve faces and hands, but in the current state it's still pretty bad. Just the stuff we were mentioning above. As expected, they are working on it, and I am surprised by how fast it's developing. A lot happens in just a week.

Alright, let's post a picture. I like this little bugger. It's macro photography again. With the current model 1.4, SD still has trouble counting limbs, and where they are attached to.

I Just Wanted to Post an Image Thread

Adventurous

Attachments

Esteemed

Adventurous

Attachments

Adventurous

Esteemed

Engaged

Adventurous

Wise

Wolff Playing with Beez!

Drawing Life 1 Pixel at a Time

Engaged

Sage

Adventurous

Brilliant

Adventurous

Attachments

Esteemed

Adventurous

Esteemed

Adventurous

Attachments

Esteemed