I get that. But it seems like fishing for something rather than having any control over the creation. Spin the wheel, take your chances. Soon, no one will bother to think.
You can see what comprises a good prompt that way and mix and match. You can take point 5. and put in "midday" instead of "golden hour", etc. You can take "candid photograph" and add it to the prompt 5. etc. Every single comma delimited point in the Chat GPT response list is a building block for a new variant the AI can spit out. If you don't know how to quickly pull together a complex prompt for your purpose you can enlist another language model AI to help you find the right sequence of words and inspire you. You might see "performance photography" and realize that yes, this is what the thing you had in mind actually is, etc.
If you have a fast machine - say an RTX 4090 with fast CPU, you can crank out stable diffusion results in seconds and iterate by playing around with the words. Iteration x speed are a powerful method to get to great results very fast.
I think you have a lot of control and it is super easy with not much "talent" or "skill" needed.
Next-gen AI will also have more and more advanced control features meaning you can add sample imagery or other style constraints on top to constrain the result. Firefly already lets you say whether you want photo realistic or not, for example.
With advances in computing power and more sophisticated models trained on higher-res imagery we should soon have access to 4k native image generative AI - a good 4k base can be retouched in PS or upscaled to create fine art you can print on a wall.
AI models are trained on millions of images - the possibilities to get what you want are vast and it is not too difficult to get the hang of it.
Everyone will be a fine art artist in no time!