18 - 18 gloriously bizarre failures. <the bats go wild> ah ha ha ha ha ha <despairing head bang>

546
3
  • Pilindë Pebrimbor's avatar Artist
    Pilindë Pe...
  • DDG Model
    ProVideo
  • Access
    Public
  • Created
    1yr ago

Prompt

Action: 1. The shiny metallic silver-haired woman is wearing a button-up white blouse with the collar button undone and a tan high-waisted, pleated mid-thigh length mini-skirt adorned with decorative thin red and cream horizontal stripes just above the hemline, showcasing her shapely bare legs, set off by black high-heeled ankle boots. 2. She quickly turns slightly toward the left of the view to adjust her course to walk briskly along the straight wall running toward the left side of the view, moving away from the dresser until it disappears off the right edge of the view behind her. The camera remains centered on a medium shot to frame her from just below the hem of her skirt to her head throughout this movement. 3. Next she comes to a door and quickly opens it, then turns to face the camera, revealing behind her a bedroom through the opening, with decor, walls, floor, and furnishings in styles and colors matching the area she is in. The camera zooms out to frame her in a head-to-toe longshot. Furnishings in the bedroom revealed behind her include a bed, a vanity with a mirror, a chair in front of the vanity, and a large armoire. lighting: Soft, natural light from the window. Style: Realistic, observational style. Avoid dramatic or over-the-top camera movements. Notes: The woman's movements should be smooth and natural, as if this is all a part of her daily routine. Her expression is happy and relaxed.

More about 18 - 18 gloriously bizarre failures. <the bats go wild> ah ha ha ha ha ha <despairing head bang>

*Almost*; *nearly*; HEART-BREAKINGLY close to right. This model has some very strange ideas about how doors work, and just won't listen if you try to fix them. I made two very subtle changes to the prompt attempting to stop it hanging two doors in one frame and doing bizarre, magical things with the door opening both ways and sliding across the header or whatever it thought it was doing in those clips. Also added 'multiple door' to my negative prompt.

That did at least accomplish something, though I also believe it's entirely possible that my editing the prompt to make sure the word 'door' only appears in it once had nothing to do with this slight improvement. Simply random variation, which happened to break slightly in my favor. But... there's still two doors in that frame. They're just both swung the same way, and one of them appears like magic out of the wall. I'm charitably ignoring the turn in the wall the prompt said should be straight.

My theory based only on observation of what we can actually see happening when we click buttons with this system, since there's no documentation or explanations of even the basics of how it works, is that the AI model is not, in fact, given the prompt we write directly. There's a parser or pre-processor or whatever the engineers call it in this particular implementation which goes through the prompt and (poorly) analyzes what we wrote, putting it into a defined format the model was trained to understand.

I base that claim on spending a fair amount of time (and compute energy) hitting the "enhance prompt" button. A label, I note, that probably violates the consumer protection laws of multiple jurisdictions, it should more properly be labelled "distort prompt beyond recognition" or maybe "massacre prompt" would be more succinct. My guess (since there's no docs) is that button simply runs the prompt we earnestly labor over trying to get the AI to produce something vaguely resembling our mind's eye image of what we want through the blender of the preprocessor it's going to run before submitting it to the the LLM model.

Though that's an insult to a blender, which at least leaves all the chopped up bits whirling around together even after destroying their integrity and relation to each other. Without exception, every time I run 'enhance prompt', important information is missing from the result. LOTS of it. If I'm even close to right about what's happening, the system is more or less counting on the 'ooo, ahhh' factor of beautiful results for users to go, "well, not what I wanted, but really pretty - might as well make it public and get a few likes". Not a lot of artistic integrity or fidelity to our vision rather than randomness in that process, but I suppose it has some quite valid uses. Just not what at least one paying customer is looking for. <sigh>

Comments


Loading Dream Comments...