AI image generation is a rapidly evolving field that offers a novel way to produce images. This rule discusses different AI image generators and what we can use AI images for.
Prompts are the instructions that you input. They can be as simple or as complex as you like. A general prompt might look like "generate an image of a sunset over the ocean," which tells the AI exactly what you're looking for. A well-structured prompt often has the format “A {{ TYPE OF PICTURE }} of a {{ MAIN SUBJECT }}, {{ STYLE CUES }}”.
“A brown dog on a skateboard”
✅ Figure: Good example - A basic prompt
You can add more detail to make a more effective prompt by following this template:
{{ ADJECTIVE }}, {{ EMOTION }}, {{ SUBJECT }}, {{ STYLE }}, {{ COLOR }}
Negative prompting is specifying what you don't want in your image. It can be an effective way of guiding the AI away from certain features that you're not interested in. Some AI image generators (e.g. Midjourney and Dreamstudio) have this option. In others (e.g. DALLE-2), you can include it in your prompt.
“An open highway --no cars”
✅ Figure: Good example - A prompt with a negative element in Midjourney format
Parameters allow you to control different aspects of the generated image via settings on the image generator. Most AI image generators have parameter options, and they can significantly affect the result.
"A scene"
❌ Figure: Bad example - A vague prompt like this gives an ambiguous image
"A snowy mountain landscape at sunset... majestic peaks adorned with glistening snow, bathed in warm hues, creating an ethereal and serene atmosphere. The scene evokes awe as untouched slopes and frozen trees blend with the fading light, leaving an indelible impression of nature's grandeur. Two travelers cross a footbridge over a small creek in the foreground."
❌ Figure: Bad example - This prompt is too long
"A snowy mountain landscape at sunset with warm hues"
✅ Figure: Good example - A detailed description will provide the AI with specific elements to incorporate, resulting in a more accurate image
As of now, the top contenders are DALL-E 2, Midjourney, and DreamStudio. Each of these has features that make them stand out.
DALL-E is an AI system capable of creating realistic images from a natural language description.
Figure: "A red tree in a valley. Hi res" - by DALL-E2
Midjourney is used on Discord, where users interact with the bot by typing /imagine.
Note: A Discord account is required first.
Figure: "A red tree in a valley. Hi res" - by Midjourney
DreamStudio is made by StabilityAI and is used, like DALLE2, on a web interface. It is based on the Stable Diffusion model of image generation.
You can use the demo here for free Stable Diffusion Web, or you can use it through the DreamStudio interface (starting with a free trial).
Figure: "A red tree in a valley. Hi res" - by DreamStudio