Image models reward specificity. “A cat” gives you a generic cat; “a ginger tabby on a rain-streaked windowsill, soft overcast light, 35mm, shallow depth of field, melancholic mood” gives you a photograph. This tool takes your rough idea and, using your own OpenAI or Anthropic key, rewrites it into three polished prompt variants tuned for your target generator.
How it works
Choose a provider and model, paste your API key, type your rough prompt, and select the target generator — Midjourney, DALL-E, or Stable Diffusion. The tool sends one direct request from your browser asking the model to enrich your prompt with concrete cues for style, lighting, composition, lens, and mood, and to format the output for the chosen generator (parameter flags for Midjourney, tag lists for Stable Diffusion, natural language for DALL-E). It returns three distinct variants so you can pick a direction.
Your key never reaches a Gera server — it is held only in the tab and sent straight to the provider (with the official direct-browser-access header for Anthropic). Refreshing clears it.
What gets added
- Style — medium, art movement, or reference aesthetic.
- Lighting & mood — direction, quality, colour, atmosphere.
- Composition & lens — framing, angle, focal length, depth of field.
Tips
- Keep your input prompt focused on the subject; let the refiner handle the descriptive scaffolding.
- Generate variants, render each, then merge the best phrasing into a final prompt.
- Cheaper models (gpt-4o-mini, claude-3-5-haiku) are plenty for prompt rewriting and keep cost negligible.