Z Image Turbo を ComfyUI で実行する+プロンプトガイド
Z Image Turbo はベースモデルではなく蒸留モデル(DMD)であることに注意。
軽量モデルの中ではダントツの基本性能を持っており、SDXL 以降のモデルをファインチューンして使うなら Z Image のベースモデル一択だろう。
Automatic1111 互換の forge-neo でも実行できる。
Qwen Image Edit や FLUX.2 は編集も生成もできるが、Z Image はそうではない。
LoRa 作成用モデル
- Z Image Omni Base:生成と編集の事前学習のみ。ファインチューニングも強化学習もされておらず画質が良くない。大規模追加学習向け
- Z Image:ファインチューニングはされているが、強化学習・RLHF はされていない
- Z Image Edit:共通事前学習・Edit 用事前学習・Edit 用ファインチューニングが施されたバージョン。蒸留はされていないのである程度のステップ数が必要。論文によると CFG ありで 50 ステップ
生成用モデル
- Z Image Turbo:ファインチューニング・蒸留・強化学習・RLHF が適用されたフルスペック版。Z Image は蒸留と強化学習とを同時に行う DMDR を採用している
Z-Image VS Turbo
Turbo は実写の画像生成のリアルさに強い。シードで多様性は出せないが、破綻した画像を出す確率が低い。
Z-Image(Turbo でない方)は多様なアニメ・イラストスタイルが表現できる。実写の品質もそこそこ高く、シードで多様性が出せる。ただし推論に時間がかかる(Turbo のおよそ5倍)。
Turbo と Z-Image との比較は Z-Image Base VS Z-Image Turbo を参照。
目次
特徴
公式モデルなので当然 NSFW やキャラ、アーティストは認識しない。
- 軽量モデルの中ではダントツの基本性能
- 手が溶けづらい
- 背景が破綻しづらい
- 英語と中国語ならプロンプトの指示に忠実
- 高画質
- 背景のよほど小さいオブジェクト等でない限り Detailer が不要
- モデルサイズ 6B と比較的軽量。
- 2.6B Lumina-Image 2.0
- 2.6B SDXL(U-Net のみ)
- 8B SD3
- 20B Qwen Image Edit
- 32B FLUX.2
- 2k 出力可能
- 学習時の解像度は 512~1,536
- RTX3050 で 2,048 x 1,536 の画像を 150 秒で生成可能
- ヌードの出力が一応可能
- 英語と中国語はテキスト描画可能
- プロンプトに日本語が使える
- 背景が上手い
欠点
- プロンプト
- プロンプトは英語か中国語で書いた方が正確。テキストエンコーダーの日本語理解力は低い
- 英語と中国語の描画能力は高いが、漫画を描かせるとテキストに間違いが多い
- 日本語の描画は一部対応
- NSFW の絡みは不可
- キャラ、アーティストは認識しない
- 1,536 を超える解像度は、超えた部分の画像が不安定になる。2k を超えると、超えた部分はほぼ破綻する
- アップスケールワークフローで 4MP 程度までは対応可能
必要スペック
モデル(12 GB)+テキストエンコーダー(8 GB)+VAE(0.3 GB)で 20 GB なので RAM 32 GB で足りる。
最新の ComfyUI は VRAM の量は重要ではなくなっている。なぜなら RAM にモデルをロードして、必要な分を VRAM に転送して処理するようなアルゴリズムになったからだ。画像生成 AI は演算ボトルネックなので、RAM からモデルを転送しつつ推論しても生成速度はほとんど低下しない。
ComfyUI のバージョンは 0.3.75。
画像の VRAM 使用量
512 x 512 = 0.1 GB。
| 解像度 | VRAM使用量 (GB) |
|---|---|
| 512 x 512 | 0.1 |
| 1024 x 1024 | 0.4 |
| 1024 x 1536 | 0.6 |
| 1536 x 1536 | 0.8 |
| 1536 x 2048 | 1.2 |
| 2048 x 2048 | 1.6 |
モデル
Z-Image Turbo
| 配置場所 | モデル |
|---|---|
| models/unet | z_image_turbo_bf16.safetensors |
| models/text_encoders | qwen_3_4b.safetensors |
| models/vae | ae.safetensors |
ワークフローは ComfyUI_examples の画像を ComfyUI にドラッグする。
そのほかのモデル
| モデル | URL |
|---|---|
| AIO | SeeSee21/Z-Image-Turbo-AIO |
| VRAM8GB向けの量子化(fp8 と INT8) | lightx2v/Z-Image-Turbo-Quantized |
| テキストエンコーダーのGGUF | Qwen/Qwen3-4B-GGUF |
| 実写モデルのディティールを強化する VAE | Owen777/UltraFlux-v1 |
| ノイズを減らしてアニメ向けにする VAE | Anzhc/Z-Image_Anime_VAE |
| 追加の蒸留で2ステップ生成するモデル | inclusionAI/TwinFlow-Z-Image-Turbo |
RTX 4000 番台以降を使用している場合、fp8_scaled を使うと推論速度が 50% 以上高速になるのでおすすめ。RAM が少ない環境なら GGUF や DFloat11 が選択肢に入る。
RTX 5000 番台以降は nunchaku の nvfp4(fp4) が使える。推論速度がおよそ2倍高速になる。ただし LoRa が使えない。
DFloat11 は bf16 をロスレス圧縮し、モデルのメモリ使用量を 30% 削減する。VRAM 12 GB に Z Image のモデルをすべて載せられる。しかし DFloat11 は生成速度は低下する可能性が高い。
| 量子化 | URL |
|---|---|
| GGUF | jayn7/Z-Image-Turbo-GGUF |
| fp8_scaled | Kijai/Z-Image_comfy_fp8_scaled |
| fp8_scaled | Z-image Turbo [TensorCoreFP8] |
| DFloat11 | z_image_turbo_bf16-DF11.safetensors |
| nunchaku 公式 int4 | svdq-int4_r256-z-image-turbo.safetensors |
| nunchaku 公式 nvfp4(fp4) | svdq-fp4_r128-z-image-turbo.safetensors |
nunchaku の品質比較は Z-Image Turbo BF16, NVFP4, Nunchaku Basic Comparison を参照。
Z-Image
| モデル | URL |
|---|---|
| 公式 | Tongyi-MAI/Z-Image |
| jayn7/Z-Image-GGUF | GGUF |
| ComfyOrg版 bf16 | z_image_bf16.safetensors |
LoRa
LoRa の学習はベースモデル(Tongyi-MAI/Z-Image)を使う。
ai-toolkit や musubi-tuner がサポートしている。
VRAM8GB で musubi-tuner でファインチューニングする方法は VRAM8GB で Z Image の LoRA作成を参照。
蒸留 LoRa
自作 LoRa 適用後にこの LoRa を適用すると低ステップ生成ができる。
DiffSynth-Studio/Z-Image-Turbo-DistillPatch
I2L
Turbo の LoRa 学習
ai-toolkit がサポートしている。ostris/zimage_turbo_training_adapter が必要。v2 は LoRa のランクと学習時間とを増やしたバージョンでこちらが推奨されている。
ai-toolkit は block swap を実装しているので低 VRAM(それでも 12 GB はあった方がいい)でも学習は可能。
ostris/zimage_turbo_training_adapter は何をしているのか?
差分学習によって概念だけを抽出している。
zimage_turbo_training_adapter は、ただの Z Image Turbo で生成された画像を学習させた LoRa だ。しかし普通に学習させたので蒸留能力はなくなっている。この LoRa を適用したモデルを使って LoRa を作成すると、作成した LoRa には概念だけが学習される。
この概念だけを抽出したものを Z Image Turbo に適用することで、蒸留能力を維持した状態で LoRa を適用できる。
ただし、Z Image Turbo は「蒸留と同時に強化学習+蒸留後の RLHF」を実施しているので、追加学習の量が多いほど画質が劣化してしまう欠点がある。
クラウドでの学習
クラウドなら1回数百円(Runpod なら$10、vast.ai なら $5 が最低チャージ金額)で LoRa が作成できるので、LoRa を量産するのでない限り GPU を買い替える必要はない。クラウドは VRAM の量が多く、より高解像度の画像で学習できるので品質も良くなる。
クラウド+ ai-toolkit での学習方法は以下の動画を参照。
de turbo
ostris/Z-Image-De-Turbo は Turbo で生成した画像で普通に学習させ蒸留を無くしたもの。Z Image Base が公開されるまではこれをベースに LoRa を作成できる。
学習データが偏っているので公式の Base より性能は低くなる。
実行時間
Turbo ではない Z-Image は CFG5、ステップ数 20 の場合、Turbo の5倍遅い。
共通設定
- ステップ数:8(公式は9を推奨している)
- サンプラー:euler
- スケジューラー:simple
- CFG:1
推論速度(s/it)のまとめ
SageAttention 単体で 15~25%、TorchCompile を追加すると 30% 前後高速化する。
| 解像度 | 最適化なし | SageAttention | SageAttention + TorchCompile |
|---|---|---|---|
| 1,024 x 1,024 | 5.2 | 4 | 3.7 |
| 1,536 x 1,024 | 7.2 | 5.7 | 5.3 |
| 1,536 x 1,536 | 10.7 | 9.0 | 8.0 |
| 2,048 x 1,536 | 16.1 | 11.8 | 11.2 |
最適化なし
環境
- Windows 11 24H2
- ComfyUI 0.3.75
- RTX3050 8GB
- RAM 32GB
- python 3.11.6
- torch 2.8.0+cu126
- モデル・テキストエンコーダーの量子化 bf16
実行時間
| 解像度 | 推論速度 (s/it) | RAM 使用量の ピーク(GB) |
|---|---|---|
| 1,024 x 1,024 | 5.2 | 30 |
| 1,536 x 1,024 | 7.2 | 30 |
| 1,536 x 1,536 | 10.7 | 30 |
| 2,048 x 1,536 | 16.1 | 30 |
SageAttention
インストール方法は RTX3050+ComfyUI で SageAttention の導入を参照。
SageAttention 単体で 15~25% 推論速度が高速化する。
環境
- Windows 11 24H2
- ComfyUI v0.6.0-3-g532e2850 | Released on '2025-12-24'
- RTX3050 8GB
- RAM 32GB
- python 3.12.9
- torch 2.9.1+cu128
- triton_windows-3.5.1.post23
- sageattention-2.2.0+cu128torch2.9.0.post3
- モデル・テキストエンコーダーの量子化 Q8_0
実行時間
| 解像度 | 推論速度 (s/it) | RAM 使用量の ピーク(GB) |
|---|---|---|
| 1,024 x 1,024 | 4 | 24 |
| 1,536 x 1,024 | 5.7 | 24 |
| 1,536 x 1,536 | 9.0 | 24 |
| 2,048 x 1,536 | 11.8 | 24 |
SageAttention + TorchCompile
環境は SageAttention と同じで、モデルロード後に TorchCompileModel ノードを実行している。backend は inductor も cudagraphs も推論速度は変わらなかったので、inductor を使用。
SageAttention 比で5%前後の高速化。最適化なしとの比較では 30% 程度高速。
実行時間
| 解像度 | 推論速度 (s/it) | RAM 使用量の ピーク(GB) |
|---|---|---|
| 1,024 x 1,024 | 3.7 | 24 |
| 1,536 x 1,024 | 5.3 | 24 |
| 1,536 x 1,536 | 8.0 | 24 |
| 2,048 x 1,536 | 11.2 | 24 |
高速生成
- RTX4000 以降の GPU を使い、fp8_scaled のモデルを使う
- SageAttention を導入する
- NVFP4 量子化のモデルを使う。ComfyUI now supports (some) NVFP4 models
- res_multistep サンプラーで4ステップで生成する
4ステップ生成は細部の甘さが目立つが、プロンプトの検証では役に立つ。上記の施策を適用した RTX 5090 は数秒で 2k の画像を生成する。
Controlnet
Z-Image-Turbo-Fun-Controlnet-Union-2.1-2601-8steps.safetensors を models/model_patches に配置。
ModelPatchLoader ノードで Controlnet をロードし、ZImageFunControlnet や QwenImageDiffsynthControlnet ノードで適用。Get Image Size ノードで Controlnet の画像と生成画像との解像度を一致させると便利。
OpenposePreprocessor ノードや CannyEdgePreprocessor をインストールすると画像から Controlnet 用の画像を生成できる。
Controlnet の画質改善
Controlnet を強く効かせるには Strength を1にするしかないが、そうすると画質は悪化する。以下の対策がある。
- KSamplerAdvanced を使い、最初の5ステップは Controlnet を使い、残りの4ステップは Controlnet を使わないようにする
- Controlnet で4ステップ前後で生成し、Controlnet なしで i2i する
外部リンク
- alibaba-pai/Z-Image-Turbo-Fun-Controlnet-Union-2.1
- alibaba-pai/Z-Image-Turbo-Fun-Controlnet-Union-2.0
- alibaba-pai/Z-Image-Turbo-Fun-Controlnet-Union GitHub
プロンプトガイド
否定形を使わない
描いてほしくないものはネガティブプロンプトに入力する。ただし推論時間が2倍になる。なお Turbo のような蒸留モデルはネガティブプロンプトと相性が悪い。
否定形を使う方法
- ネガティブプロンプトを入力し、CFG を1以上にする
- CFGNorm ノードや APG guidance ノードで高彩度を防ぐことができる
- NAG を使う(NAG (Normalized Attention Guidance) works on Z-Image Turbo now.)
- NegPiP を使う
あいまいな形容詞を避ける
✖: beautiful girl
〇:A girl with flowing silver hair that catches the moonlight
記述した方がいい項目
| 項目 | 例 |
|---|---|
| 位置関係 | standing at the top of stairs, sitting under the tree |
| 視線 | looking down at viewer, gazing upward at the sky |
| 感情 | with a confident smile, mysterious expression |
| 空気感 | in a dreamlike atmosphere, surrounded by magical sparkles |
| 質感 | silk-like hair, crystalline dress |
| 動き | hair swaying in the breeze, petals falling around her |
構造化する
markdown で構造化するとプロンプトが無視される確率が下がる。
悪い例
She says "セリフ1" in a speech bubble in the left top. She says "セリフ2" in a speech bubble in the left middle. She says "セリフ3" in a speech bubble in the left bottom.
良い例
# speech bubble There are three speech bubbles. - speech bubble 1 (left top): "セリフ1" - speech bubble 2 (left middle): "セリフ2" - speech bubble 3 (left bottom): "セリフ3"
その他のプロンプト例
# person There are three persons. - person 1 (left): a girl, short red hair, blue eyes, sitting on a stool - person 2 (middle): a boy, short brown hair, grey eyes, standing, v sign - person 3 (right): a girl, long silver hair, red eyes, sitting on a sofa, holding a card with the word "test" written on it.
# person There are five persons. From left to right - person 1: a girl, short red hair, blue eyes, sitting on a stool - person 2: a boy, short brown hair, grey eyes, standing, v sign - person 3: a girl, long silver hair, red eyes, sitting on a sofa, holding a card with the word "test" written on it. - person 4: a boy, green eyes, wearing a baseball cap, sitting - person 5: a girl, brown eyes, long hair
# background In the luxury interior. a king size bed, a disheveled sheets, a disheveled satin dress, a lamp on a bedside table, a pillow and a large window with a view from a high-rise building in the background.
# background ## left There is a bookshelf, many piled books and a potted plant. ## middle There is a window, curtains and a bed. ## right There is a book and a cup on a desk.
# style Highly detailed illustration style blending anime aesthetics with painterly realism.
Edit
編集プロンプトには2種類ある。編集後のみをプロンプトに入れる方法と、差分を指定する方法だ。Z Image Edit はどちらも対応している。ピンポイントで画像を修正する場合は差分指定、参考画像として入力する場合は編集後のみを指定する方法を使う。
- 編集後のみの例: A girl with red eyes and white long hair is standing.
- 差分を指定する例:Change her eye color from blue to red, her hair length from short to long and her posture from sitting to standing.
プロンプトエンハンサーを使う
LLM を使って不足している情報を補わせる。ローカルでもクラウドでも可能。
ユーザーの入力したプロンプトを構造化して、*英語で*出力してください。情報が不足している場合は、アシスタントが自由に記述してください。 style of painting は以下の中からひとつ選んでください:photo realistic, anime, illustration, digital painting。 camera position は high angle, eye level, waist level, low angle から選択してください。 shot size は close up, medium shot, wide shot, full shot, long shot から選択してください。 *shot size で close up か medium shot かを選択した場合は、foot: の項目を削除してください*。 foot は barefoot か靴を履かせるかを選択してください。靴を履かせる場合はどのような靴かどのような色かを書いてください。 location は indoor, outdoor から選択してください。 出力は編集後の構造化されたプロンプトのみにしてください。英語で出力してください。 # ユーザーの入力したプロンプト 女の子が室内で立っている。 # output # style of painting - # camera - camera position: - shot size: # person - garment: - foot: - pose: - hair length: - facial expression: # background - location: ## left - ## middle - ## right - # settings - time of day: - season: - weather: - lighting:
以下の例は古いプロンプトで生成しているので、不足している項目がある。
gpt-oss-20b の出力例
ユーザーの入力したプロンプトは「女の子がサイバーパンクシティのビルの屋上で立っていて、景色を眺めている。」。
プロンプトに「一般的ではないポーズとロケーションとシチュエーションを記述してください。」を追加している。
# style of painting - anime # camera - camera position: low angle - shot size: medium shot # person - garment: sleek, reflective cyberpunk jumpsuit with embedded LED strips forming abstract patterns; a translucent trench coat drapes over her shoulders. - pose: standing on the rooftop railing, one hand resting on the metal rail, the other holding a small holo-map; head slightly tilted toward the horizon. - hair length: long, flowing hair interwoven with faint neon strands that pulse softly in sync with city lights. - facial expression: contemplative with a subtle, enigmatic smile, eyes reflecting distant neon signs. # background - location: outdoor ## left - towering skyscrapers piercing a misty night sky, drones and floating advertisement billboards hovering above the rooftops. ## middle - the girl herself, framed against the city skyline. ## right - a massive holographic billboard projecting an abstract digital art piece that casts shifting colors onto her silhouette. # settings - time of day: night - season: winter (frosted edges on the rooftop rail) - weather: light mist and rain droplets suspended in the air, catching neon reflections. - lighting: backlit by a myriad of neon signs; subtle rim lighting highlights her outline against the darkness.
Gemini 3 高速モードの出力例
ユーザーの入力したプロンプトは「女の子が都会の屋外で座っている。」。
# style of painting - anime # person - garment: casual wear (jeans, hoodie) - foot: shoes (white sneakers) - pose: sitting on a bench, looking at her phone - hair length: medium - facial expression: calm, slightly focused # background - location: outdoor ## left - skyscraper ## middle - city park bench, street view ## right - street lamp, small cafe in the distance # settings - time of day: late afternoon - season: autumn - lighting: soft, natural light, slight golden hour glow
Z Image Engineer
プロンプトエンハンサー用の、ユーザーが作成したモデル。データソースは Gemini。
BennyDaBall/qwen3-4b-Z-Image-Engineer
顔文字
Z-IMAGE-TRUBO-NEW-FEATURE DISCOVERED
- a girl making this face "{o}.{o}" , anime
- a girl making this face "X.X" , anime
- a girl making eyes like this ♥.♥ , anime
- a girl making this face exactly "(ಥ﹏ಥ)" , anime
認識するキャラリスト
英語では認識しないが中国語だと認識することもある(フリーレンや五条悟)。
- ドラゴンボール
- NARUTO
- セーラームーン
- 原神
- Furina (芙宁娜)
- 初音ミク
- ピカチュウ
- モンキー・D・ルフィ
- ドラえもん
- 江戸川コナン
- 竈門炭治郎
- アーニャ (spy x family)
- フリーレン(芙莉莲)
- 五条悟
- 坂田銀時(银魂)
- レム (Re:ゼロ)(雷姆)
スタイルリスト
- Style Preset Library
- Z-Image Style System Deep Dive: How to Achieve 70+ Art Styles Through Prompt Templates
ガチャ
Z-Image(Turbo ではないほう)はシードで絵が変わるので細工は不要。
ランダムな文字を入れる方法は少しランダム性を加える。最初の1ステップを dpmpp_2m_sde サンプラーで空のプロンプトでデノイズする方法は、ステップ数を減らすとランダム性が大きくなる。
- 「fdahoiuhroewa」のようなランダムな文字列をプロンプトに入れる
- カスタムノードの SeedVarianceEnhancer を使う
- Z-Image と Z-Image Turbo の2段サンプラーにする
- 最初の1ステップを dpmpp_2m_sde サンプラーで空のプロンプトでデノイズして、残りを通常のプロンプトでデノイズする
- strength を 0.7 で t2i した後に、i2i する
Unlock diversity of Z-image-Turbo, comparison
以下のようなカスタムサンプラーを組む方法が紹介されている。ただし以下のワークフローは画質が悪いので、この後に通常のワークフローで i2i をする必要がある。
上記のワークフローの ModelSamplingAuraFlow の数値を増やすとランダム性が増えるが画質が悪化する。BasicScheduler と SplitSigams とでステップ数を制御する。
このワークフローは
- ModelSamplingAuraFlow でタイムステップをノイズが増える方向にずらす
- ずらしたタイムステップのノイズが多い最初の5個を捨てる
これによりノイズを残しつつデノイズする。なので画質が悪い(残留ノイズがある)かわりに、ノイズのランダム性を取り込むことができる。
SplitSigmas は Automatic1111 の Prompt Editing を実装するのに使える。SplitSigmas は途中でモデルを変更可能かつ多段階分割可能なので、より柔軟な制御が可能。
解像度リスト
うまく描けないときは以下を試してみる。
- 解像度を 1,536 以下(学習時の解像度)で生成する
- 解像度をさげて描けるならアップスケールワークフローを使う
- アスペクト比を変える
| 総ピクセル数 | 9:16 | 2:3 | 3:4 | 4:5 | 1:1 |
|---|---|---|---|---|---|
| 1MP | 768 * 1344 | 832 * 1280 | 896 * 1152 | 896 * 1152 | 1024 * 1024 |
| 1.25MP | 832 * 1536 | 960 * 1408 | 960 * 1344 | 1024 * 1280 | 1152 * 1152 |
| 1.5MP | 960 * 1664 | 1024 * 1536 | 1088 * 1472 | 1152 * 1408 | 1280 * 1280 |
| 1.75MP | 1024 * 1792 | 1088 * 1664 | 1152 * 1536 | 1216 * 1536 | 1344 * 1344 |
| 2MP | 1088 * 1920 | 1152 * 1792 | 1280 * 1664 | 1280 * 1600 | 1472 * 1472 |
| 2.5MP | 1216 * 2176 | 1344 * 1984 | 1408 * 1856 | 1472 * 1792 | 1600 * 1600 |
| 3MP | 1344 * 2368 | 1472 * 2176 | 1536 * 2048 | 1600 * 1984 | 1792 * 1792 |
| 4MP | 1536 * 2752 | 1664 * 2496 | 1792 * 2368 | 1856 * 2304 | 2048 * 2048 |
| 5MP | 1728 * 3072 | 1856 * 2816 | 1984 * 2624 | 2048 * 2560 | 2304 * 2304 |
| 6MP | 1856 * 3328 | 2048 * 3072 | 2176 * 2880 | 2240 * 2816 | 2496 * 2496 |
| 7MP | 2048 * 3584 | 2240 * 3328 | 2368 * 3136 | 2432 * 3008 | 2688 * 2688 |
| 8MP | 2176 * 3840 | 2368 * 3520 | 2496 * 3328 | 2560 * 3264 | 2880 * 2880 |
| 9MP | 2304 * 4096 | 2496 * 3776 | 2688 * 3520 | 2752 * 3456 | 3072 * 3072 |
| 10MP | 2432 * 4288 | 2624 * 3968 | 2816 * 3712 | 2880 * 3648 | 3264 * 3264 |
アップスケーラー
SeedVR2 が評判がいい。公式は 32bit でファイルサイズが大きいので fp8 の numz/SeedVR2_comfyUI などを使う。
サンプラーとスケジューラー
Perfect Z Image Settings: Ranking 14 Samplers & 10 Schedulers でサンプラーとスケジューラーの比較をしているがどれも変わらない。
ディティールを増やしたければ以下の方法がある
- ステップ数を9以上に増やす
- 解像度を上げる
- ModelSamplingAuraFlow のシフト量を増やす
- Owen777/UltraFlux-v1 の VAE を使う
Z-Image Step CFG
ステップ数と CFG の影響は以下のスレッドを参照。
- big initial Z Image settings comparison: steps x CFG
- Z-Image Base - Schedulers/Samplers - What's your go to? (scroll through)
CFG5、ステップ数 20 で十分という印象。Turbo が CFG1で8ステップなので、Turbo 比で5倍遅い。
作例
Z-Image の指定がない画像は Turbo 製。サンプラーは Euler、スケジューラーは simple。
作例は以下のリンクを参照。
There are three girls in a room. The girl on the left has short red hair and blue eyes. She is sitting on a stool holding a card with the word "左" written on it. The girl in the middle has long silver hair and red eyes. She is standing and holding a card with the word "中" written on it in both hands. The girl on the right has medium brown hair and green eyes. She is sitting on a stool and holding a card with the word "右" written on it. There are potted plants and a kitchen in the background. Anime style.
プロンプトの指示をすべて満たし、画質もよく、手が溶けておらず、背景も破綻していない。軽量モデルの中ではダントツの基本性能を持っている。
以下のような、markdown + タグの羅列も機能する。
There are three girls in a room. # person There are three girls. - left: short red hair, blue eyes, sitting on a stool, holding a card with the word "左" written on it. - middle: long silver hair, red eyes, standing, holding a card with the word "中" written on it in both hands. - right: medium brown hair, green eyes, sitting on a stool, holding a card with the word "右" written on it. # background potted plants, a kitchen. # style Anime style.
There is a piece of paper with "あいうえおアイウエオ かきくけこカキクケコ さしすせそサシスセソ たちつてとタチツテト なにぬねのナニヌネノ はひふへほハヒフヘホ まみむめもマミムメモ やゆよヤユヨ らりるれろラリルレロ わをんワヲン" written on it.
中国語にある漢字は描けるが、ひらがな・片仮名が描けない。
Anime style. # person 芙莉莲's upper body. - appearance: silver hair and green eyes - facial expression: closing one eye with slight smile - objects: She is holding a white board with handwritten "It works!" - garment: a coat with a red jewelry on her neck button # background The background features a snowy night with bokeh.
A digital painting of Furina(芙宁娜). ネガティブプロンプト 3d, realistic
Japanese anime style illustration of Mutsuki from Blue Archive, full-body portrait, standing gracefully with long flowing grey-white hair and vibrant purple-pink eyes, gazing directly at viewer with a sweet, closed-mouth smile. Wears an elegant fusion garment combining elements of pink Hanbok and Kimono: wide, floor-length sleeves, layered flowing skirt, accented with satin pink ribbons and small artificial hair flowers nestled near her temple. Holds in both hands a large, lush bouquet composed entirely of fresh pink cherry blossoms with visible stamens and dewdrops. Above her head, her signature circular halo emits a soft, constant glow with subtle radial gradient, casting no shadows. Background: dreamlike night scene under a massive, luminous full moon positioned upper-center, bathing entire scene in cool, diffused silver-blue moonlight interwoven with ambient pink-tinted atmospheric haze. Countless ethereal cherry blossom petals float mid-air in gentle, swirling trajectories; one small songbird with detailed feathers flies diagonally left-to-right near shoulder height; three butterflies flutter nearby — two with iridescent purple wings, one with bright yellow wings, all rendered with translucent wing membranes and fine vein patterns. Top-right corner: the Chinese characters "春意盎然" rendered in traditional brush calligraphy style, pure white ink on semi-transparent background layer, stroke texture visible, characters aligned vertically, each approximately 1/8 the height of the total canvas, positioned 5% inward from top and right edges. Color palette dominated by soft pinks, lavenders, and moonlit blues; fabric shows realistic silk sheen and fold dynamics; petals exhibit slight translucency and edge glow; ground surface implied by shallow depth-of-field blur but absent solid terrain — figure appears suspended in celestial air. Composition balanced with vertical emphasis, Mutsuki centered slightly below midpoint, moon and text forming diagonal counterweight. No lens flare, no bloom effects beyond natural glow sources. ネガティブプロンプト 3d,realistic
# style Pure illustration. The soft lighting. # character There are two girls in a room. One is standing, the other is sitting on a stool. ## character 1 A long-haired girl with blue hair wearing a white dress and knee boots is standing facing forward on the left. She hides her hands behind her back. ## character 2 On the right, a short girl with long red hair is sitting on a wood stool with her legs spread and leaning forward. Her arms are placed in front of her and her hands on the stool. She is wearing a black dress and knee boots. She is facing front. Her height is short. # background In the background are a window, curtains, a table and a potted plant.
hanagata,Fiction illustrations, akengihci, simple, A solo young girl with blue hair and blue eyes, wearing a white shirt with short sleeves and a black choker. She has earrings and makeup, with a short ponytail. Her closed mouth shows a subtle expression as she looks directly at the viewer. The simple white background highlights her elegant jewelry and clean style. ネガティブプロンプト 3d,realistic
A digital artwork of exquisite detail. A fantastical woman with long, silver-white hair and clear, beautiful eyes. The artwork emphasizes the textures of various materials: the softness of feathers on her headdress, the warm luster of pearls, and the sharp glint of crystal jewelry. Her skin is smooth and delicate, her lips moist. Strong contrasts of light and shadow highlight every detail, from individual strands of hair to the glitter in her eye makeup. The background is deep darkness, filled with shimmering magical particles. ネガティブプロンプト 3d,realistic
Anime key visual. A group of girls playing beach volleyball. The main character is jumping for a spike, dynamic mid-air pose. She is wearing a revealing string bikini that defies physics. Sand flying, water splashing, detailed anatomy. # background high contrast sunlight, sky, sea, valleyball pole, valleyball net
プロンプトの出典:https://www.reddit.com/r/StableDiffusion/comments/1p7a800/comment/nqwm0wx/
A hyper-detailed, maximalist anime illustration that looks like a stained-glass window made of neon lights. The art style is strictly flat, vector-like 2D with zero realistic textures. The scene features a mysterious anime girl floating in a void of floating geometric shapes, playing cards, and giant ticking clocks. The composition is a dizzying spiral that draws the eye to the center. The character has rainbow-gradient hair that has no individual strands but acts as a solid shape with hard-edged highlights. Her eyes are incredibly detailed with multiple concentric rings of color. The color palette is an explosion of cyan, hot pink, and lime green, with absolutely no gray or brown tones. The shadows are colored (e.g., purple shadows on pink skin) rather than black. The background is packed with intricate patterns—checkerboards, polka dots, and Japanese traditional wave patterns—all fighting for attention but perfectly balanced in a "pop art" anime aesthetic.
The illustration of a chibi girl sitting in a chair eating a piece of pizza. She is saying "This is fine!" in a speech bubble. There is a table and a window in the indoor room.
Anime-style fan art character illustration of a fairy warrior, a confident and magical female character with long flowing pink hair, soft gradients and dynamic strands, smiling with calm confidence and inner strength, bright expressive anime eyes, holding an elegant magical staff topped with a glowing crystal, wearing a detailed fantasy outfit that blends light armor with flowing fabric, subtle magical runes and ornaments, delicate fairy wings shimmering softly behind her, heroic yet graceful pose, vibrant fantasy color palette, dynamic lighting that highlights her face and staff, clean anime linework with high-detail shading, polished fan-art illustration style, magical atmosphere, pure illustration (not realistic), 16:9 aspect ratio
face detailer のプロンプト
Anime-style fan art character illustration of a fairy warrior, a confident and magical female character with long flowing pink hair, soft gradients and dynamic strands, smiling with calm confidence and inner strength, bright expressive anime eyes, vibrant fantasy color palette, dynamic lighting that highlights her face, clean anime linework with high-detail shading, polished fan-art illustration style, magical atmosphere, pure illustration (not realistic)
プロンプトの出典:https://www.reddit.com/r/StableDiffusion/comments/1qgcuso/flux2_klein_vs_flux2_vs_zimage/
anime style young woman, slender body, long white hair with bangs and an elaborate frilly headpiece, fair skin, large red eyes, wearing a black and white frilly gothic lolita dress with intricate ruffled details, posing with hands together near her face, adorned with a silver cross earring and a silver ring, against a plain light background, mood is solemn and elegant, soft lighting highlighting the delicate textures of the frills and hair, close-up shot focusing on the character's detailed features and accessories
A young girl with long, black hair and bangs, wearing delicate silver earrings and a thin necklace, looks calmly and slightly shyly at the camera. Highly detailed digital pained illustration, amine style. # ID card She holds up an ID card with her right hand. The card is the same size as a credit card. The ID card has following information: - her portrait that she wears a black top and has a serious expression - name "極楽夢衣" - date of birth "2010年04月23日" - address "東京都千代田区永田町1丁目7−1" - ID number "765-8766196" # background There is a red wooden furniture and white walls faintly visible in an indoors. # style Highly detailed digital pained illustration, amine style.
A young girl with long, black hair and bangs, wearing delicate silver earrings and a thin necklace, looks calmly and slightly shyly at the camera. Highly detailed digital pained illustration, amine style. # ID card She holds up an ID card with her right hand. The card is the same size as a credit card. The ID card has following information: - her portrait that she wears a black top and has a serious expression - name "极乐梦衣" - date of birth "2010年04月23日" - address "东京都千代田区永田町1-7-1" - ID number "765-8766196" # background There is a red wooden furniture and white walls faintly visible in an indoors. # style Highly detailed digital pained illustration, amine style.
A illustration of close-up profile of a young woman with ethereal, almost otherworldly qualities. The subject has pale, flawless skin and striking yellow eyes that are slightly downcast, giving her an introspective appearance. Her long, wavy hair is a silver-gray color, flowing dynamically and appearing almost as if its caught in a gentle breeze, adding a sense of movement to the image. She has full, red lips that contrast sharply with her pale complexion. She is wearing a white garment with a red collar, which is slightly visible at the bottom of the frame. # background The background is a dark, gradient gray, which helps to highlight the subjects luminous features and hair. # lighting The lighting is soft and diffused, casting gentle shadows that enhance the three-dimensionality of her face and hair. The overall mood of the image is serene and mystical, with a focus on the delicate and ethereal qualities of the subject. # style This is a digital painting in a realistic, semi-abstract style. Use smooth, blended brushstrokes to create a sense of realism while maintaining a slightly fantastical atmosphere.
プロンプトの出典は Flux 2 K 9b 25.01.26
A gritty fantasy illustration of a --- An redhead girl with arrogant face expression, heavy makeup, smokey eyes, from side, tilted head, bright eyes, UwU, black choker, long silver earrings # tags Concept art, rough texture, hatching, colorful, made with soft pastel inks,
Anime style. An Anime girl with realistic textures reclines in a cozy, tech-infused workspace bathed in a cool, ambient blue light. She has silky grey hair styled into a high ponytail secured with a black bow, with sleek, reflective black sunglasses resting atop her head—shimmering with the glow of the screen behind her. Her outfit blends casual and edgy vibes: a white crop top with bold blue "RIDE" lettering across the chest, layered under a black zip-up jacket featuring plush blue fur trim on the collar and metallic buttons that catch the light. Fingerless black gloves encase her hands, one holding a white coffee mug emblazoned with the MIT logo—steam delicately curling upward from the rim, hinting at warmth. Her lower body showcases tight black legwear: high-waisted shorts with adjustable straps that cinch her thighs and waist, emphasizing her relaxed pose as she lounges with legs playfully bent. Her left hand rests near her cheek, fingers softly grazing her skin, eyes half-lidded in a serene, contemplative expression that conveys calm focus. The background reveals a modern workspace: a monitor displays an anime-style character with frosty, futuristic details, a sleek keyboard and mouse sit beside it, and wireless speakers with glowing blue LED accents add to the digital ambiance. The soft blue lighting casts gentle shadows across smooth skin, silky hair, and textured fabric, enhancing the cozy yet stylish atmosphere of a personal creative sanctuary. Every detail—from the subtle sheen of the mug to the precision of the jacket’s stitching—blends anime’s vibrant storytelling with realistic rendering, creating an immersive scene where comfort and tech culture harmonize. Anime style.
Against a light blue sky, a large white long-haired cat (with gradient deep blue eyes and a light blue collar) leans against a girl's profile, both looking up in unison. The girl has slightly curly short hair, cherry-shaped earrings, and wears light blue and white clothing. The scene is filled with pink, yellow, and blue petals and heart-shaped light spots, the colors are clear and bright, and the overall atmosphere is healing and dreamy, full of the fresh feeling of spring. (Pastel illustration style)
A split-screen composite portrait of a full body view of a single Chinese woman with opening mouth, front view. The image is divided vertically down the exact center of her face. The left half is a black and white rough anime pencil sketch style, the right half is digital painting. The facial features align perfectly across the center line to form one continuous body. Seamless transition.
The game screen capture style. The Genshin Impact character, Hu Tao(胡桃), is depicted in a lively pose on the right side of the image. She wears a black outfit with red and white accents on the sleeves and skirt lining. Her long, dark brown hair flows in the wind and she wears a distinctive flower-adorned hat. Her right hand is near her face, and in her left she holds a glowing red polearm. A distinctive white ghost floats at her feet. The background is a rich autumn landscape, with shrubs and trees colored in orange and red, creating depth. The ground is covered with mossy rocks and lush vegetation, and is decorated with a small lantern and plants with pink flowers. A warm light shines throughout, highlighting the character.
This illustration combines pen and ink drawing with watercolor techniques, featuring highly saturated colors. A girl sits on a chair, smiling as she holds chopsticks. A vibrant, an abundant tuna and salmon sushi and sashimi platter neatly arranged within a simple, light brown cardboard takeaway box. The subtle texture of the box's matte surface provides a minimalist backdrop, allowing the vibrant colors of the sushi to truly pop. The left side features a meticulously arranged, full tuna and salmon sashimi rose with delicate, orange-pink layers, accompanied by two simple, thick slices of salmon. To the right, a delightful array of individual salmon-wrapped rice balls are generously topped with swirls of creamy white cream cheese and garnished with bright green, finely chopped scallions or chives. Below these, two organized rows of inside-out salmon and cream cheese sushi rolls (uramaki) showcase their fluffy white rice exteriors, heavily speckled with contrasting black sesame seeds, revealing the rich salmon filling in their cleanly cut cross-sections. The scene is illuminated by soft, diffused natural light, creating gentle highlights on the creamy fillings and the glossy salmon, enhancing the appetizing appeal without harsh shadows. The edges of the box are slightly visible, grounding the scene and emphasizing its 'to-go' nature. The box is placed on a table. There is a soy source in a saucer on the left side of the box.
# Core Feature - Character Type: A cute female figure in a chibi (Q-style) aesthetic. - Facial Features: Large red eyes, a small nose and mouth, long, wavy golden hair, and bangs. - Headwear: A large bow on her head, made of white, gray, and dark blue striped fabric. A blue cross and two red lines are affixed to her forehead. - Pose: Left hand waving, right hand making a "V" sign. # Clothing Details - Overall: Wears a white, multi-layered ruffled dress. - Neckline and Cuffs: Delicate white lace trim on the neckline and cuffs. - Chest Accessory: A blue and white striped bow tied at the chest, with a red round gemstone in the center. - Pendant: Two dark blue ribbons hang from the bow, each ending in a gold cross pendant. - Other: Gold buttons adorn the dress. # Background Elements There is a charactere in the blurry background. - Environment: Another character is faintly visible in the background. This character has golden hair and black demon/bat-wing-like horns, and wears black clothing. # Style and Texture - Style: Japanese anime style, emphasizing cuteness and refinement. - Materials: The texture of the clothing fabric, the delicacy of the lace, and the luster of the metallic pendants all need to be depicted.
Atmospheric anime illustration of a beautiful young woman with long flowing black hair standing on an old school staircase, looking back over her shoulder directly at the viewer with an enigmatic expression. Character wears a classic school uniform consisting of a white blouse with navy blue tie, dark pleated skirt, black knee-high socks and leather shoes. Dramatic cinematic lighting from above creates strong contrast between illuminated figure and shadowy staircase surroundings, casting sharp shadows across the tile walls. Environment features worn ceramic tiles in checkerboard pattern on walls and steps, wrought iron handrail with peeling paint, and a vintage directional sign mounted on the wall showing Japanese characters for "出口". Thick-coated 3D texture rendering with visible brushstrokes creates tangible surface quality on tiles, fabric and metal. Cool color palette dominated by deep blues and grays with strategic highlights in warm tones on skin and hair. Highly detailed illustration style blending anime aesthetics with painterly realism, emphasizing the visual impact of light and shadow interplay. Mysterious, contemplative mood with slight gothic undertones, professional digital painting with exceptional material rendering.
A girl holding a M4 assault rifle in her right hand, is stainding in the apocalypse city. She is looking at side. She is wearing a hooded jacket, a skirt and boots with a backpack. # background The background is a dark, gradient snowing gray. Draw following objects: - disfigured road - disfigured buildings - many broken cars, tanks - debris and rubble - a broken traffic light # style This is a digital painting in an anime but semi-realistic style. Dramatic cinematic lighting and mysterious, contemplative mood. Draw the M4 assault rifle in detail.
Z Image Turbo は手と銃が崩れやすく 100 回程度ガチャをした。こういうのは Nano Banana Pro や Edit モデルで修正した方が早い。
Qwen Image Edit は手が崩れにくかった。プロンプトの忠実性が高いのは Qwen Image Edit だが、画質が良くディティールが豊富なのは Z Image Turbo。
# composition ## right A girl with long flowing black hair standing on the right side of the table. The girl wears a classic school uniform consisting of a white blouse with navy blue tie, dark pleated skirt, black knee-high socks. ## middle There is a round mini table on the center of the image. There is a empty coffee cup on the table. ## left There is a bookshelf and piled books in the one. # background In the living room, there are a window, curtains, bookshelfs, piled books, a table, and a potted plant. # style Atmospheric anime illustration. The soft lighting creates a humorous atmosphere.
「画面の右側に人がいる」のようなプロンプトは無視され、画面中央に人が配置される。構図を具体的に指定する必要がある。
A young woman is standing on a subway platform. She is short black hair, wearing a light gray cap, a light colored cropped top and short black pants. She is holding an iPhone in her left hand. # background There are signs, Braille blocks and blurred figures, with a sense of urban transit in the subway station. There is a sign written "出口" on it. # style japanese manga, line art, pen art, black and white, halftone
物を持たせると手が怪しい。点字ブロックは知っているが描くのはうまくはない。
あとキャラの位置が高い。キャラの頭の位置を、背景の人の頭の位置に合わせた方がいい。
スタイル
作例は I continue to be impressed with Z-Image-Turbo's style flexibility.。プロンプトは https://pastebin.com/ZbmMjrzF
Girl with long wavy hair wearing a pink tank top and black shorts in hand-drawn manga art style, casually leaning against a stone wall and holding a cigarette, slight smirk, black and white manga illustration, city skyline with glowing lights and a vibrant sunset in the background, heavy screentone shading on the wall, clothing, and smoke, strong rim lighting from the sunset illuminating her side profile, medium long shot composition with the city slightly blurred in the background, pen and ink lineart, rebellious and dramatic atmosphere, traditional manga panel aesthetic with speed lines emanating from the cigarette smoke
長いプロンプトを見る
**CHARACTER DESIGN: “THE TWO GIRLS” — STYLIZED FOR ANIME AESTHETIC** — **CHARACTER 1 (LEFT — CAMERA GIRL)** *Stance:* Upright, centered in frame. Dynamic pose: right arm raised holding a sleek black camera at eye level; left hand on hip, thumb visible. Slightly tilted posture to suggest focus and intent. *Facial Design:* - Smooth, light-medium skin tone with flat cel shading — no pores or texture. - Large, expressive eyes in light brown/hazel with thick black eyelashes — simplified but emotive. - Small, neat nose and full lips drawn with clean lines — neutral expression rendered through minimal line work. - Hair: short, dark brown blunt cut with crisp edges — frames face symmetrically using thick black outlines. No strands or flyaways. *Body & Attire:* - Simplified torso with exaggerated waistline — no anatomical weight, just stylized curves. - Bikini bottoms: bold orange and white stripes with thin black waistband — flat color blocks, no fabric texture. - Lower abdomen exposed as clean negative space — no pubic hair detail, just smooth contour lines. *Pose Emphasis:* - Arm gesture exaggerated for visual energy — camera held high, fingers curled slightly into stylized “holding” shape. - Hip hand creates balance — thumb shown as a small, clean oval with a single line for definition. — **CHARACTER 2 (RIGHT — LEANING GIRL)** *Stance:* Leaning against wall, angled toward left. Head tilted slightly — playful smirk implied through a small curve in the mouth and one eyebrow raised. *Facial Design:* - Rounded face with smooth, matte skin tone — flat cel shading. - Dark brown eyes looking directly forward — thick lashes and almond-shaped pupils with minimal detail. - Full lips drawn in a confident, curved line — no depth or texture. - Hair: dark brown pulled back into messy bun with patterned black headscarf — thick outline around the scarf, minimal strands inside. *Body & Attire:* - Bikini bottoms: solid black with thin waistband — smooth, stretchy silhouette stylized as a single curved shape. - Lower abdomen exposed with clean contour lines — no biological detail, just simplified silhouette. *Pose Emphasis:* - Left arm relaxed against wall — fingers loosely drawn as simple arcs. - Right hand on abdomen — stylized palm with clean lines, no texture or realism. — **STYLIZATION RULES APPLIED:** — **ANTI-REALISM FILTER (FULLY ENFORCED):** • All references to pores, skin texture, sweat, veins — removed. Replaced with smooth cel-shading and hard-edge shadows. • Fabric texture, creasing — replaced with flat color blocks and simplified folds. • Pubic hair — removed entirely; lower abdomen rendered as clean negative space with stylized contour lines. • Lighting: flat, even illumination — no subsurface scattering or complex shading. Shadows are hard-edged and minimal. — **STYLISTIC DEFINITION (HAND-DRAWN ANIME CEL):** • Thick, bold black ink outlines define all character contours — crisp edges with no soft blending. • Interior details: thin, clean linework for facial features, clothing seams, and accessories — no shading gradients. • Colors: flat, saturated hues with minimal variation — orange stripe contrast against skin tone and black bikini. • Background: simplified white walls with minimal detail — door handle as a thin, geometric shape. — **SETTING & ATMOSPHERE (STYLIZED):** — *Location:* Bathroom hallway — walls and door rendered as flat, smooth surfaces with thick outlines. • Walls: plain white — no patterns or textures. Door handle visible as a simple black-and-white shape on the left edge of frame. • Floor: not shown — only implied by negative space and character positioning. *Lighting:* Bright, even — no shadows cast on walls or floor. All elements lit with flat illumination to enhance cel-shading. — **COMPOSITION & TECHNICALS:** — *Camera Angle:* Eye-level, slightly above head — captures torsos and upper legs in tight medium shot. • Framing: minimal background, girls occupy most of the frame — negative space balanced on either side. • Color Palette: Neutral base (white, beige) with accent colors — orange stripes and black bikini. • Style: “Hand-drawn anime cel” with exaggerated forms, simplified anatomy, and bold outlines. — **FINAL NOTES:** This design sheet removes all realism while preserving the emotional tone, pose dynamics, and color contrast. The characters are rendered as stylized anime figures — expressive, clean-lined, and visually dynamic without biological detail. The environment supports the characters with minimalism and flat shading — perfect for animation production. — **END OF STYLE SHEET**
This image features a striking, stylized anime illustration of a cute girl sitting by the sea. # Character & Pose - Appearance: The girl has short, messy black hair that appears windblown. She has pale skin and bright blue eyes, looking upward with a calm in profile, contemplative expression. He is blowing out cigarette smoke softly. - Pose: She is seated on a dark, rocky outcrop at the edge of the water. Her right hand rests on the rock for support, while her left hand is raised, holding a lit cigarette near her face. Her legs are bent, with her right knee pulled slightly toward her chest. # Attire & Accessories - Swimwear: She is wearing a simple black string bikini. The top features a prominent white skull and crossbones graphic on the right cup. - Footwear: In a stylistic contrast to her swimwear, she is wearing black high-top sneakers (resembling Chuck Taylor All-Stars) with white laces and white rubber toes. - Item: She is holding a cigarette with a thin trail of white smoke curling upward into the sky. # Environment - Background: The setting is a minimalist coastal scene. The blue sea meets a light, pale blue sky at a sharp horizon line. Small white ripples suggest gentle waves hitting the shore. # Style - very pale color - Skin tones: white skin, with subtle peach shading - hand-drawn manga art style - pen and ink lineart
体操プロンプト
1回で成功することはまれで数回やり直す必要がある。5. Standing Split With Forward Fold はプロンプトを修正し 10 回以上生成した。
プロンプトの出典は:https://www.reddit.com/r/StableDiffusion/comments/1qhv0g1/flux_klein_gives_me_sd3_vibes/
1. Revolved Half Moon Balance
A woman standing on one leg, full body visible, the supporting foot planted firmly on the ground while the opposite leg extends straight backward at hip height. Her torso twists sideways toward the camera, one hand reaching down to touch the floor while the other arm stretches vertically upward. Spine visibly rotated, shoulders stacked unevenly, hips misaligned by design. Tight athletic clothing clearly showing leg separation, knee alignment, ankle angle, and the twist of the waist. Camera at waist height, slight three-quarter angle, clean studio lighting revealing exact limb positioning.
2. One-Legged Crow Transition
A woman balanced low to the ground in a yoga arm balance. Both hands planted flat on the floor, elbows bent at sharp angles, shoulders leaning forward. One knee rests against the upper arm while the opposite leg extends backward fully off the ground. Head slightly lifted, neck extended forward. Weight distribution clearly visible through shoulder compression and wrist angle. Full body in frame from a low side angle, emphasizing arm strain, bent joints, and asymmetry between legs.
3. Deep Backbend Dropback
A woman standing upright mid-transition into a deep backbend. Knees slightly bent, hips pushed forward, spine arched dramatically backward. Head tilted fully behind her with face upside down relative to torso. Arms reaching behind toward the floor but not yet touching. Rib cage lifted, abdomen stretched, pelvis visibly angled forward. Shot from the side at chest height, strong directional lighting highlighting spinal curvature and torso deformation under tension.
4. Twisted Seated Bind
A woman seated on the ground with one leg folded under her and the other bent across her body. Torso twisted sharply in the opposite direction of the legs. One arm wraps behind her back while the other reaches around the front to clasp the wrist, forming a closed bind behind her torso. Shoulders uneven, spine corkscrewed. Camera positioned slightly above, looking down to emphasize overlapping limbs and hidden joints. Clear visibility of hand placement, elbow direction, and torso rotation.
5. Standing Split With Forward Fold
A woman bending forward but her left leg is straight upward. She is wearing a black highleg leotard. # background and the camera Camera directly from the side to expose hip misalignment, leg separation, knee locking, and foot orientation. Neutral background, sharp lighting, no motion blur.
6. King Pigeon
プロンプトの出典は Z-Image Base test images so you don't have to。
A high-detail, full-body shot of a professional yoga instructor performing a complex "King Pigeon" pose on a wooden deck at sunrise. She is sitting with her spine deeply arched, one arm reaching back to grasp her upturned foot, and the other hand resting on her knee. She is wearing tight, charcoal-gray ribbed leggings and a sports bra, revealing the natural musculature of her core and shoulders. The morning sun creates a rim light along her body, highlighting the skin texture and muscle definition. # style Photorealistic, perfect anatomy, balanced proportions. Shot on Sony A7R V, 50mm f/1.2 GM lens, aperture f/2.0, 1/500s shutter, ISO 100. Crisp focus on the subject with a soft, sun-drenched coastal background.
外部リンク
advanced prompt adherence: Z image(s) v. Flux(es) v. Qwen(s)













































