Z Image Turbo を ComfyUI で実行する＋プロンプトガイド

カテゴリ：deeplearning

Z Image Turbo はベースモデルではなく蒸留モデル（DMD）であることに注意。

Automatic1111 互換の forge-neo でも実行できる。

Qwen Image Edit や FLUX.2 は編集も生成もできるが、Z Image はそうではない。

LoRa 作成用モデル

Z Image Omni Base：生成と編集の事前学習のみ。ファインチューニングも強化学習もされておらず画質が良くない。大規模追加学習向け
Z Image：ファインチューニングはされているが、強化学習・RLHF はされていない
Z Image Edit：共通事前学習・Edit 用事前学習・Edit 用ファインチューニングが施されたバージョン。蒸留はされていないのである程度のステップ数が必要。論文によると CFG ありで 50 ステップ

生成用モデル

Z Image Turbo：ファインチューニング・蒸留・強化学習・RLHF が適用されたフルスペック版。Z Image は蒸留と強化学習とを同時に行う DMDR を採用している

Z-Image VS Turbo

Turbo は実写の画像生成のリアルさに強い。シードで多様性は出せないが、破綻した画像を出す確率が低い。

Z-Image（Turbo でない方）は多様なアニメ・イラストスタイルが表現できる。実写の品質もそこそこ高く、シードで多様性が出せる。ただし推論に時間がかかる（Turbo のおよそ５倍）。

Turbo と Z-Image との比較は Z-Image Base VS Z-Image Turbo を参照。

特徴

公式モデルなので当然 NSFW やキャラ、アーティストは認識しない。

軽量モデルの中ではダントツの基本性能

手が溶けづらい
背景が破綻しづらい
英語と中国語ならプロンプトの指示に忠実
高画質
背景のよほど小さいオブジェクト等でない限り Detailer が不要

モデルサイズ 6B と比較的軽量。

2.6B Lumina-Image 2.0
2.6B SDXL（U-Net のみ）
8B SD3
20B Qwen Image Edit
32B FLUX.2

2k 出力可能

学習時の解像度は 512～1,536

RTX3050 で 2,048 x 1,536 の画像を 150 秒で生成可能
ヌードの出力が一応可能
英語と中国語はテキスト描画可能
プロンプトに日本語が使える
背景が上手い

欠点

プロンプト

プロンプトは英語か中国語で書いた方が正確。テキストエンコーダーの日本語理解力は低い
英語と中国語の描画能力は高いが、漫画を描かせるとテキストに間違いが多い

日本語の描画は一部対応
NSFW の絡みは不可
キャラ、アーティストは認識しない
1,536 を超える解像度は、超えた部分の画像が不安定になる。2k を超えると、超えた部分はほぼ破綻する

アップスケールワークフローで 4MP 程度までは対応可能

必要スペック

モデル（12 GB）＋テキストエンコーダー（8 GB）＋VAE（0.3 GB）で 20 GB なので RAM 32 GB で足りる。

最新の ComfyUI は VRAM の量は重要ではなくなっている。なぜなら RAM にモデルをロードして、必要な分を VRAM に転送して処理するようなアルゴリズムになったからだ。画像生成 AI は演算ボトルネックなので、RAM からモデルを転送しつつ推論しても生成速度はほとんど低下しない。

ComfyUI のバージョンは 0.3.75。

画像の VRAM 使用量

512 x 512 = 0.1 GB。

解像度	VRAM使用量（GB）
512 x 512	0.1
1024 x 1024	0.4
1024 x 1536	0.6
1536 x 1536	0.8
1536 x 2048	1.2
2048 x 2048	1.6

モデル

Z-Image Turbo

配置場所	モデル
models/unet	z_image_turbo_bf16.safetensors
models/text_encoders	qwen_3_4b.safetensors
models/vae	ae.safetensors

ワークフローは ComfyUI_examples の画像を ComfyUI にドラッグする。

そのほかのモデル

モデル	URL
AIO	SeeSee21/Z-Image-Turbo-AIO
VRAM８GB向けの量子化（fp8 と INT8）	lightx2v/Z-Image-Turbo-Quantized
テキストエンコーダーのGGUF	Qwen/Qwen3-4B-GGUF
実写モデルのディティールを強化する VAE	Owen777/UltraFlux-v1
ノイズを減らしてアニメ向けにする VAE	Anzhc/Z-Image_Anime_VAE
追加の蒸留で２ステップ生成するモデル	inclusionAI/TwinFlow-Z-Image-Turbo

RTX 4000 番台以降を使用している場合、fp8_scaled を使うと推論速度が 50% 以上高速になるのでおすすめ。RAM が少ない環境なら GGUF や DFloat11 が選択肢に入る。

RTX 5000 番台以降は nunchaku の nvfp4(fp4) が使える。推論速度がおよそ２倍高速になる。ただし LoRa が使えない。

DFloat11 は bf16 をロスレス圧縮し、モデルのメモリ使用量を 30% 削減する。VRAM 12 GB に Z Image のモデルをすべて載せられる。しかし DFloat11 は生成速度は低下する可能性が高い。

量子化	URL
GGUF	jayn7/Z-Image-Turbo-GGUF
fp8_scaled	Kijai/Z-Image_comfy_fp8_scaled
fp8_scaled	Z-image Turbo [TensorCoreFP8]
DFloat11	z_image_turbo_bf16-DF11.safetensors
nunchaku 公式 int4	svdq-int4_r256-z-image-turbo.safetensors
nunchaku 公式 nvfp4(fp4)	svdq-fp4_r128-z-image-turbo.safetensors

nunchaku の品質比較は Z-Image Turbo BF16, NVFP4, Nunchaku Basic Comparison を参照。

Z-Image

モデル	URL
公式	Tongyi-MAI/Z-Image
jayn7/Z-Image-GGUF	GGUF
ComfyOrg版 bf16	z_image_bf16.safetensors
非公式蒸留モデル	GuangyuanSD/Z-Image-Distilled

LoRa

LoRa の学習はベースモデル（Tongyi-MAI/Z-Image）を使う。

ai-toolkit や musubi-tuner がサポートしている。

VRAM８GB で musubi-tuner でファインチューニングする方法は VRAM８GB で Z Image の LoRA作成を参照。

蒸留 LoRa

自作 LoRa 適用後にこの LoRa を適用すると低ステップ生成ができる。

DiffSynth-Studio/Z-Image-Turbo-DistillPatch

I2L

Turbo の LoRa 学習

ai-toolkit がサポートしている。ostris/zimage_turbo_training_adapter が必要。v2 は LoRa のランクと学習時間とを増やしたバージョンでこちらが推奨されている。

ai-toolkit は block swap を実装しているので低 VRAM（それでも 12 GB はあった方がいい）でも学習は可能。

ostris/zimage_turbo_training_adapter は何をしているのか？

差分学習によって概念だけを抽出している。

zimage_turbo_training_adapter は、ただの Z Image Turbo で生成された画像を学習させた LoRa だ。しかし普通に学習させたので蒸留能力はなくなっている。この LoRa を適用したモデルを使って LoRa を作成すると、作成した LoRa には概念だけが学習される。

この概念だけを抽出したものを Z Image Turbo に適用することで、蒸留能力を維持した状態で LoRa を適用できる。

ただし、Z Image Turbo は「蒸留と同時に強化学習＋蒸留後の RLHF」を実施しているので、追加学習の量が多いほど画質が劣化してしまう欠点がある。

クラウドでの学習

クラウドなら１回数百円（Runpod なら$10、vast.ai なら $5 が最低チャージ金額）で LoRa が作成できるので、LoRa を量産するのでない限り GPU を買い替える必要はない。クラウドは VRAM の量が多く、より高解像度の画像で学習できるので品質も良くなる。

クラウド＋ ai-toolkit での学習方法は以下の動画を参照。

de turbo

ostris/Z-Image-De-Turbo は Turbo で生成した画像で普通に学習させ蒸留を無くしたもの。Z Image Base が公開されるまではこれをベースに LoRa を作成できる。

学習データが偏っているので公式の Base より性能は低くなる。

実行時間

Turbo ではない Z-Image は CFG５、ステップ数 20 の場合、Turbo の５倍遅い。

共通設定

ステップ数：８（公式は９を推奨している）
サンプラー：euler
スケジューラー：simple
CFG：１

推論速度（s/it）のまとめ

SageAttention 単体で 15～25%、TorchCompile を追加すると 30% 前後高速化する。

解像度	最適化なし	SageAttention	SageAttention + TorchCompile
1,024 x 1,024	5.2	4	3.7
1,536 x 1,024	7.2	5.7	5.3
1,536 x 1,536	10.7	9.0	8.0
2,048 x 1,536	16.1	11.8	11.2

最適化なし

環境

Windows 11 24H2
ComfyUI 0.3.75
RTX3050 8GB
RAM 32GB
python 3.11.6
torch 2.8.0+cu126
モデル・テキストエンコーダーの量子化 bf16

実行時間

解像度	推論速度（s/it）	RAM 使用量のピーク（GB）
1,024 x 1,024	5.2	30
1,536 x 1,024	7.2	30
1,536 x 1,536	10.7	30
2,048 x 1,536	16.1	30

SageAttention

インストール方法は RTX3050+ComfyUI で SageAttention の導入を参照。

SageAttention 単体で 15～25% 推論速度が高速化する。

環境

Windows 11 24H2
ComfyUI v0.6.0-3-g532e2850 | Released on '2025-12-24'
RTX3050 8GB
RAM 32GB
python 3.12.9
torch 2.9.1+cu128
triton_windows-3.5.1.post23
sageattention-2.2.0+cu128torch2.9.0.post3
モデル・テキストエンコーダーの量子化 Q8_0

実行時間

解像度	推論速度（s/it）	RAM 使用量のピーク（GB）
1,024 x 1,024	4	24
1,536 x 1,024	5.7	24
1,536 x 1,536	9.0	24
2,048 x 1,536	11.8	24

SageAttention + TorchCompile

環境は SageAttention と同じで、モデルロード後に TorchCompileModel ノードを実行している。backend は inductor も cudagraphs も推論速度は変わらなかったので、inductor を使用。

SageAttention 比で５%前後の高速化。最適化なしとの比較では 30% 程度高速。

実行時間

解像度	推論速度（s/it）	RAM 使用量のピーク（GB）
1,024 x 1,024	3.7	24
1,536 x 1,024	5.3	24
1,536 x 1,536	8.0	24
2,048 x 1,536	11.2	24

高速生成

RTX4000 以降の GPU を使い、fp8_scaled のモデルを使う

SageAttention を導入する

NVFP4 量子化のモデルを使う。ComfyUI now supports (some) NVFP4 models
res_multistep サンプラーで４ステップで生成する

４ステップ生成は細部の甘さが目立つが、プロンプトの検証では役に立つ。上記の施策を適用した RTX 5090 は数秒で 2k の画像を生成する。

Controlnet

Z-Image-Turbo-Fun-Controlnet-Union-2.1-2601-8steps.safetensors を models/model_patches に配置。

ModelPatchLoader ノードで Controlnet をロードし、ZImageFunControlnet や QwenImageDiffsynthControlnet ノードで適用。Get Image Size ノードで Controlnet の画像と生成画像との解像度を一致させると便利。

ワークフロー

OpenposePreprocessor ノードや CannyEdgePreprocessor をインストールすると画像から Controlnet 用の画像を生成できる。

Controlnet の画質改善

Controlnet を強く効かせるには Strength を１にするしかないが、そうすると画質は悪化する。以下の対策がある。

KSamplerAdvanced を使い、最初の５ステップは Controlnet を使い、残りの４ステップは Controlnet を使わないようにする
Controlnet で４ステップ前後で生成し、Controlnet なしで i2i する

外部リンク

プロンプトガイド

否定形を使わない

描いてほしくないものはネガティブプロンプトに入力する。ただし推論時間が２倍になる。なお Turbo のような蒸留モデルはネガティブプロンプトと相性が悪い。

否定形を使う方法

ネガティブプロンプトを入力し、CFG を１以上にする

CFGNorm ノードや APG guidance ノードで高彩度を防ぐことができる

NAG を使う（NAG (Normalized Attention Guidance) works on Z-Image Turbo now.）
NegPiP を使う

あいまいな形容詞を避ける

✖： beautiful girl

〇：A girl with flowing silver hair that catches the moonlight

記述した方がいい項目

項目	例
位置関係	standing at the top of stairs, sitting under the tree
視線	looking down at viewer, gazing upward at the sky
感情	with a confident smile, mysterious expression
空気感	in a dreamlike atmosphere, surrounded by magical sparkles
質感	silk-like hair, crystalline dress
動き	hair swaying in the breeze, petals falling around her

構造化する

markdown で構造化するとプロンプトが無視される確率が下がる。

悪い例

She says "セリフ1" in a speech bubble in the left top.
She says "セリフ2" in a speech bubble in the left middle.
She says "セリフ3" in a speech bubble in the left bottom.

良い例

# speech bubble
There are three speech bubbles.
- speech bubble 1 (left top): "セリフ1"
- speech bubble 2 (left middle): "セリフ2"
- speech bubble 3 (left bottom): "セリフ3"

その他のプロンプト例

# person
There are three persons.
- person 1 (left): a girl, short red hair, blue eyes, sitting on a stool
- person 2 (middle): a boy, short brown hair, grey eyes, standing, v sign
- person 3 (right): a girl, long silver hair, red eyes, sitting on a sofa, holding a card with the word "test" written on it.

# person
There are five persons. From left to right
- person 1: a girl, short red hair, blue eyes, sitting on a stool
- person 2: a boy, short brown hair, grey eyes, standing, v sign
- person 3: a girl, long silver hair, red eyes, sitting on a sofa, holding a card with the word "test" written on it.
- person 4: a boy, green eyes, wearing a baseball cap, sitting
- person 5: a girl, brown eyes, long hair

# background
In the luxury interior.
a king size bed, a disheveled sheets, a disheveled satin dress, a lamp on a bedside table, a pillow and a large window with a view from a high-rise building in the background.

# background
## left
There is a bookshelf, many piled books and a potted plant.
## middle
There is a window, curtains and a bed.
## right
There is a book and a cup on a desk.

# style
Highly detailed illustration style blending anime aesthetics with painterly realism.

Edit

編集プロンプトには２種類ある。編集後のみをプロンプトに入れる方法と、差分を指定する方法だ。Z Image Edit はどちらも対応している。ピンポイントで画像を修正する場合は差分指定、参考画像として入力する場合は編集後のみを指定する方法を使う。

編集後のみの例： A girl with red eyes and white long hair is standing.
差分を指定する例：Change her eye color from blue to red, her hair length from short to long and her posture from sitting to standing.

プロンプトエンハンサーを使う

LLM を使って不足している情報を補わせる。ローカルでもクラウドでも可能。

ユーザーの入力したプロンプトを構造化して、*英語で*出力してください。情報が不足している場合は、アシスタントが自由に記述してください。
style of painting は以下の中からひとつ選んでください：photo realistic, anime, illustration, digital painting。
camera position は high angle, eye level, waist level, low angle から選択してください。
shot size は close up, medium shot, wide shot, full shot, long shot から選択してください。
*shot size で close up か medium shot かを選択した場合は、foot: の項目を削除してください*。
foot は barefoot か靴を履かせるかを選択してください。靴を履かせる場合はどのような靴かどのような色かを書いてください。
location は indoor, outdoor から選択してください。

出力は編集後の構造化されたプロンプトのみにしてください。英語で出力してください。

# ユーザーの入力したプロンプト

女の子が室内で立っている。

# output
# style of painting
- 

# camera
- camera position:
- shot size:

# person
- garment:
- foot: 
- pose:
- hair length:
- facial expression:

# background
- location: 
## left
-
## middle
- 
## right
-

# settings
- time of day: 
- season: 
- weather:
- lighting:

以下の例は古いプロンプトで生成しているので、不足している項目がある。

gpt-oss-20b の出力例

ユーザーの入力したプロンプトは「女の子がサイバーパンクシティのビルの屋上で立っていて、景色を眺めている。」。

プロンプトに「一般的ではないポーズとロケーションとシチュエーションを記述してください。」を追加している。

# style of painting
- anime

# camera
- camera position: low angle
- shot size: medium shot

# person
- garment: sleek, reflective cyberpunk jumpsuit with embedded LED strips forming abstract patterns; a translucent trench coat drapes over her shoulders.
- pose: standing on the rooftop railing, one hand resting on the metal rail, the other holding a small holo-map; head slightly tilted toward the horizon.
- hair length: long, flowing hair interwoven with faint neon strands that pulse softly in sync with city lights.
- facial expression: contemplative with a subtle, enigmatic smile, eyes reflecting distant neon signs.

# background
- location: outdoor
## left
- towering skyscrapers piercing a misty night sky, drones and floating advertisement billboards hovering above the rooftops.
## middle
- the girl herself, framed against the city skyline.
## right
- a massive holographic billboard projecting an abstract digital art piece that casts shifting colors onto her silhouette.

# settings
- time of day: night
- season: winter (frosted edges on the rooftop rail)
- weather: light mist and rain droplets suspended in the air, catching neon reflections.
- lighting: backlit by a myriad of neon signs; subtle rim lighting highlights her outline against the darkness.

Gemini 3 高速モードの出力例

ユーザーの入力したプロンプトは「女の子が都会の屋外で座っている。」。

# style of painting
- anime

# person
- garment: casual wear (jeans, hoodie)
- foot: shoes (white sneakers)
- pose: sitting on a bench, looking at her phone
- hair length: medium
- facial expression: calm, slightly focused

# background
- location: outdoor
## left
- skyscraper
## middle
- city park bench, street view
## right
- street lamp, small cafe in the distance

# settings
- time of day: late afternoon
- season: autumn
- lighting: soft, natural light, slight golden hour glow

Z Image Engineer

プロンプトエンハンサー用の、ユーザーが作成したモデル。データソースは Gemini。

BennyDaBall/qwen3-4b-Z-Image-Engineer

Z-Image Engineer - an LLM that specializes in z-image prompting. Anyone using this, any suggestions for prompting? Or other models to try out?

顔文字

Z-IMAGE-TRUBO-NEW-FEATURE DISCOVERED

a girl making this face "{o}.{o}" , anime
a girl making this face "X.X" , anime
a girl making eyes like this ♥.♥ , anime
a girl making this face exactly "(ಥ﹏ಥ)" , anime

認識するキャラリスト

英語では認識しないが中国語だと認識することもある（フリーレンや五条悟）。

ドラゴンボール
NARUTO
セーラームーン
原神

Furina (芙宁娜)

初音ミク
ピカチュウ
モンキー・D・ルフィ
ドラえもん
江戸川コナン
竈門炭治郎
アーニャ (spy x family)
フリーレン（芙莉莲）
五条悟
坂田銀時（银魂）
レム (Re:ゼロ)（雷姆）

スタイルリスト

ガチャ

Z-Image（Turbo ではないほう）はシードで絵が変わるので細工は不要。

ランダムな文字を入れる方法は少しランダム性を加える。最初の１ステップを dpmpp_2m_sde サンプラーで空のプロンプトでデノイズする方法は、ステップ数を減らすとランダム性が大きくなる。

「fdahoiuhroewa」のようなランダムな文字列をプロンプトに入れる
カスタムノードの SeedVarianceEnhancer を使う
Z-Image と Z-Image Turbo の２段サンプラーにする
最初の１ステップを dpmpp_2m_sde サンプラーで空のプロンプトでデノイズして、残りを通常のプロンプトでデノイズする
strength を 0.7 で t2i した後に、i2i する

Unlock diversity of Z-image-Turbo, comparison

以下のようなカスタムサンプラーを組む方法が紹介されている。ただし以下のワークフローは画質が悪いので、この後に通常のワークフローで i2i をする必要がある。

ワークフロー

上記のワークフローの ModelSamplingAuraFlow の数値を増やすとランダム性が増えるが画質が悪化する。BasicScheduler と SplitSigams とでステップ数を制御する。

このワークフローは

ModelSamplingAuraFlow でタイムステップをノイズが増える方向にずらす
ずらしたタイムステップのノイズが多い最初の５個を捨てる

これによりノイズを残しつつデノイズする。なので画質が悪い（残留ノイズがある）かわりに、ノイズのランダム性を取り込むことができる。

SplitSigmas は Automatic1111 の Prompt Editing を実装するのに使える。SplitSigmas は途中でモデルを変更可能かつ多段階分割可能なので、より柔軟な制御が可能。

解像度リスト

うまく描けないときは以下を試してみる。

解像度を 1,536 以下（学習時の解像度）で生成する

解像度をさげて描けるならアップスケールワークフローを使う

アスペクト比を変える

総ピクセル数	9:16	2:3	3:4	4:5	1:1
1MP	768 * 1344	832 * 1280	896 * 1152	896 * 1152	1024 * 1024
1.25MP	832 * 1536	960 * 1408	960 * 1344	1024 * 1280	1152 * 1152
1.5MP	960 * 1664	1024 * 1536	1088 * 1472	1152 * 1408	1280 * 1280
1.75MP	1024 * 1792	1088 * 1664	1152 * 1536	1216 * 1536	1344 * 1344
2MP	1088 * 1920	1152 * 1792	1280 * 1664	1280 * 1600	1472 * 1472
2.5MP	1216 * 2176	1344 * 1984	1408 * 1856	1472 * 1792	1600 * 1600
3MP	1344 * 2368	1472 * 2176	1536 * 2048	1600 * 1984	1792 * 1792
4MP	1536 * 2752	1664 * 2496	1792 * 2368	1856 * 2304	2048 * 2048
5MP	1728 * 3072	1856 * 2816	1984 * 2624	2048 * 2560	2304 * 2304
6MP	1856 * 3328	2048 * 3072	2176 * 2880	2240 * 2816	2496 * 2496
7MP	2048 * 3584	2240 * 3328	2368 * 3136	2432 * 3008	2688 * 2688
8MP	2176 * 3840	2368 * 3520	2496 * 3328	2560 * 3264	2880 * 2880
9MP	2304 * 4096	2496 * 3776	2688 * 3520	2752 * 3456	3072 * 3072
10MP	2432 * 4288	2624 * 3968	2816 * 3712	2880 * 3648	3264 * 3264

アップスケーラー

SeedVR2 が評判がいい。公式は 32bit でファイルサイズが大きいので fp8 の numz/SeedVR2_comfyUI などを使う。

アップスケーラーが良すぎて、なんか違法な気がする

サンプラーとスケジューラー

Perfect Z Image Settings: Ranking 14 Samplers & 10 Schedulers でサンプラーとスケジューラーの比較をしているがどれも変わらない。

ディティールを増やしたければ以下の方法がある

ステップ数を９以上に増やす
解像度を上げる
ModelSamplingAuraFlow のシフト量を増やす
Owen777/UltraFlux-v1 の VAE を使う

Z-Image Step CFG

公式の推奨設定はステップ数 28～50。

ステップ数と CFG の影響は以下のスレッドを参照。

CFG５、ステップ数 20 で十分という印象。Turbo が CFG１で８ステップなので、Turbo 比で５倍遅い。

作例

Z-Image の指定がない画像は Turbo 製。サンプラーは Euler、スケジューラーは simple。

作例は以下のリンクを参照。

Z Image Turbo

Qwen Image Edit 2509

There are three girls in a room.

The girl on the left has short red hair and blue eyes. She is sitting on a stool holding a card with the word "左" written on it.

The girl in the middle has long silver hair and red eyes. She is standing and holding a card with the word "中" written on it in both hands.

The girl on the right has medium brown hair and green eyes. She is sitting on a stool and holding a card with the word "右" written on it.

There are potted plants and a kitchen in the background.

Anime style.

プロンプトの指示をすべて満たし、画質もよく、手が溶けておらず、背景も破綻していない。軽量モデルの中ではダントツの基本性能を持っている。

以下のような、markdown + タグの羅列も機能する。

There are three girls in a room.

# person
There are three girls.
- left: short red hair, blue eyes, sitting on a stool, holding a card with the word "左" written on it.
- middle: long silver hair, red eyes, standing, holding a card with the word "中" written on it in both hands.
- right: medium brown hair, green eyes, sitting on a stool, holding a card with the word "右" written on it.

# background
potted plants, a kitchen.

# style
Anime style.

Z Image Turbo

Qwen Image Edit 2509

There is a piece of paper with "あいうえおアイウエオ
かきくけこカキクケコ
さしすせそサシスセソ
たちつてとタチツテト
なにぬねのナニヌネノ
はひふへほハヒフヘホ
まみむめもマミムメモ
やゆよヤユヨ
らりるれろラリルレロ
わをんワヲン" written on it.

中国語にある漢字は描けるが、ひらがな・片仮名が描けない。

Z-Image Turbo

Anime style.

# person

芙莉莲's upper body. 
- appearance: silver hair and green eyes
- facial expression: closing one eye with slight smile
- objects: She is holding a white board with handwritten "It works!"
- garment: a coat with a red jewelry on her neck button

# background

The background features a snowy night with bokeh.

Z-Image<br/>cfg 6, 30 steps, ModelSamplingAuraFlow 3, seed 1027744730587941

Z-Image
cfg 6, 30 steps, ModelSamplingAuraFlow 3, seed 1027744730587941

A digital painting of Furina(芙宁娜).

ネガティブプロンプト
3d, realistic

Z-Image
50 steps

Z-Image<br/>cfg 5, 30 steps, ModelSamplingAuraFlow 3, seed 466284077514030

Z-Image
cfg 5, 30 steps, ModelSamplingAuraFlow 3, seed 466284077514030

Japanese anime style illustration of Mutsuki from Blue Archive, full-body portrait, standing gracefully with long flowing grey-white hair and vibrant purple-pink eyes, gazing directly at viewer with a sweet, closed-mouth smile. Wears an elegant fusion garment combining elements of pink Hanbok and Kimono: wide, floor-length sleeves, layered flowing skirt, accented with satin pink ribbons and small artificial hair flowers nestled near her temple. Holds in both hands a large, lush bouquet composed entirely of fresh pink cherry blossoms with visible stamens and dewdrops.

Above her head, her signature circular halo emits a soft, constant glow with subtle radial gradient, casting no shadows. Background: dreamlike night scene under a massive, luminous full moon positioned upper-center, bathing entire scene in cool, diffused silver-blue moonlight interwoven with ambient pink-tinted atmospheric haze. Countless ethereal cherry blossom petals float mid-air in gentle, swirling trajectories; one small songbird with detailed feathers flies diagonally left-to-right near shoulder height; three butterflies flutter nearby — two with iridescent purple wings, one with bright yellow wings, all rendered with translucent wing membranes and fine vein patterns.

Top-right corner: the Chinese characters "春意盎然" rendered in traditional brush calligraphy style, pure white ink on semi-transparent background layer, stroke texture visible, characters aligned vertically, each approximately 1/8 the height of the total canvas, positioned 5% inward from top and right edges.

Color palette dominated by soft pinks, lavenders, and moonlit blues; fabric shows realistic silk sheen and fold dynamics; petals exhibit slight translucency and edge glow; ground surface implied by shallow depth-of-field blur but absent solid terrain — figure appears suspended in celestial air. Composition balanced with vertical emphasis, Mutsuki centered slightly below midpoint, moon and text forming diagonal counterweight. No lens flare, no bloom effects beyond natural glow sources.


ネガティブプロンプト

3d,realistic

# style

Pure illustration. The soft lighting.

# character

There are two girls in a room. One is standing, the other is sitting on a stool. 

## character 1

A long-haired girl with blue hair wearing a white dress and knee boots is standing facing forward on the left. She hides her hands behind her back. 

## character 2

On the right, a short girl with long red hair is sitting on a wood stool with her legs spread and leaning forward. 

Her arms are placed in front of her and her hands on the stool. She is wearing a black dress and knee boots.

She is facing front. Her height is short.

# background

In the background are a window, curtains, a table and a potted plant.

Z-Image<br/>cfg 5, 40 steps, ModelSamplingAuraFlow 3, seed 124135492970693

Z-Image
cfg 5, 40 steps, ModelSamplingAuraFlow 3, seed 124135492970693

hanagata,Fiction illustrations, akengihci, simple, A solo young girl with blue hair and blue eyes, wearing a white shirt with short sleeves and a black choker. She has earrings and makeup, with a short ponytail. Her closed mouth shows a subtle expression as she looks directly at the viewer. The simple white background highlights her elegant jewelry and clean style.

ネガティブプロンプト

3d,realistic

Z-Image<br/>cfg 5, 40 steps, ModelSamplingAuraFlow 3, seed 924414165643657

Z-Image
cfg 5, 40 steps, ModelSamplingAuraFlow 3, seed 924414165643657

A digital artwork of exquisite detail. A fantastical woman with long, silver-white hair and clear, beautiful eyes. The artwork emphasizes the textures of various materials: the softness of feathers on her headdress, the warm luster of pearls, and the sharp glint of crystal jewelry. Her skin is smooth and delicate, her lips moist. Strong contrasts of light and shadow highlight every detail, from individual strands of hair to the glitter in her eye makeup. The background is deep darkness, filled with shimmering magical particles.

ネガティブプロンプト

3d,realistic

Anime key visual. A group of girls playing beach volleyball. The main character is jumping for a spike, dynamic mid-air pose. She is wearing a revealing string bikini that defies physics. Sand flying, water splashing,  detailed anatomy.

# background

high contrast sunlight, sky, sea, valleyball pole, valleyball net

プロンプトの出典：https://www.reddit.com/r/StableDiffusion/comments/1p7a800/comment/nqwm0wx/

A hyper-detailed, maximalist anime illustration that looks like a stained-glass window made of neon lights. The art style is strictly flat, vector-like 2D with zero realistic textures. The scene features a mysterious anime girl floating in a void of floating geometric shapes, playing cards, and giant ticking clocks. The composition is a dizzying spiral that draws the eye to the center.

The character has rainbow-gradient hair that has no individual strands but acts as a solid shape with hard-edged highlights. Her eyes are incredibly detailed with multiple concentric rings of color. The color palette is an explosion of cyan, hot pink, and lime green, with absolutely no gray or brown tones. The shadows are colored (e.g., purple shadows on pink skin) rather than black. The background is packed with intricate patterns—checkerboards, polka dots, and Japanese traditional wave patterns—all fighting for attention but perfectly balanced in a "pop art" anime aesthetic.

プロンプトの出典：https://www.reddit.com/r/StableDiffusion/comments/1p8a54z/some_anime_style_images_i_made_with_zimage_turbo/

Z Image Turbo

Qwen Image Q3_K_M

The illustration of a chibi girl sitting in a chair eating a piece of pizza. She is saying "This is fine!" in a speech bubble. There is a table and a window in the indoor room.

左のロンググローブが描けていない

Anime-style fan art character illustration of a fairy warrior, a confident and magical female character with long flowing pink hair, soft gradients and dynamic strands, smiling with calm confidence and inner strength, bright expressive anime eyes, holding an elegant magical staff topped with a glowing crystal, wearing a detailed fantasy outfit that blends light armor with flowing fabric, subtle magical runes and ornaments, delicate fairy wings shimmering softly behind her, heroic yet graceful pose, vibrant fantasy color palette, dynamic lighting that highlights her face and staff, clean anime linework with high-detail shading, polished fan-art illustration style, magical atmosphere, pure illustration (not realistic), 16:9 aspect ratio

face detailer のプロンプト

Anime-style fan art character illustration of a fairy warrior, a confident and magical female character with long flowing pink hair, soft gradients and dynamic strands, smiling with calm confidence and inner strength, bright expressive anime eyes, vibrant fantasy color palette, dynamic lighting that highlights her face, clean anime linework with high-detail shading, polished fan-art illustration style, magical atmosphere, pure illustration (not realistic)

プロンプトの出典：https://www.reddit.com/r/StableDiffusion/comments/1qgcuso/flux2_klein_vs_flux2_vs_zimage/

anime style young woman, slender body, long white hair with bangs and an elaborate frilly headpiece, fair skin, large red eyes, wearing a black and white frilly gothic lolita dress with intricate ruffled details, posing with hands together near her face, adorned with a silver cross earring and a silver ring, against a plain light background, mood is solemn and elegant, soft lighting highlighting the delicate textures of the frills and hair, close-up shot focusing on the character's detailed features and accessories

日本語文字

A young girl with long, black hair and bangs, wearing delicate silver earrings and a thin necklace, looks calmly and slightly shyly at the camera.

Highly detailed digital pained illustration, amine style.

# ID card

She holds up an ID card with her right hand. The card is the same size as a credit card. The ID card has following information:
- her portrait that she wears a black top and has a serious expression
- name "極楽夢衣"
- date of birth "2010年04月23日"
- address "東京都千代田区永田町1丁目7−1"
- ID number "765-8766196"

# background

There is a red wooden furniture and white walls faintly visible in an indoors.

# style

Highly detailed digital pained illustration, amine style.

中国語文字

A young girl with long, black hair and bangs, wearing delicate silver earrings and a thin necklace, looks calmly and slightly shyly at the camera.

Highly detailed digital pained illustration, amine style.

# ID card

She holds up an ID card with her right hand. The card is the same size as a credit card. The ID card has following information:
- her portrait that she wears a black top and has a serious expression
- name "极乐梦衣"
- date of birth "2010年04月23日"
- address "东京都千代田区永田町1-7-1"
- ID number "765-8766196"

# background

There is a red wooden furniture and white walls faintly visible in an indoors.

# style

Highly detailed digital pained illustration, amine style.

A illustration of close-up profile of a young woman with ethereal, almost otherworldly qualities. The subject has pale, flawless skin and striking yellow eyes that are slightly downcast, giving her an introspective appearance. Her long, wavy hair is a silver-gray color, flowing dynamically and appearing almost as if its caught in a gentle breeze, adding a sense of movement to the image. She has full, red lips that contrast sharply with her pale complexion. She is wearing a white garment with a red collar, which is slightly visible at the bottom of the frame.   

# background

The background is a dark, gradient gray, which helps to highlight the subjects luminous features and hair.

# lighting

The lighting is soft and diffused, casting gentle shadows that enhance the three-dimensionality of her face and hair. The overall mood of the image is serene and mystical, with a focus on the delicate and ethereal qualities of the subject.

# style

This is a digital painting in a realistic, semi-abstract style. Use smooth, blended brushstrokes to create a sense of realism while maintaining a slightly fantastical atmosphere.

プロンプトの出典は Flux 2 K 9b 25.01.26

A gritty fantasy illustration of a 

---

An redhead girl with arrogant face expression, heavy makeup, smokey eyes, from side, tilted head, bright eyes, UwU, black choker, long silver earrings


# tags

Concept art, rough texture, hatching, colorful, made with soft pastel inks,

Anime style.

An Anime girl with realistic textures reclines in a cozy, tech-infused workspace bathed in a cool, ambient blue light. She has silky grey hair styled into a high ponytail secured with a black bow, with sleek, reflective black sunglasses resting atop her head—shimmering with the glow of the screen behind her. Her outfit blends casual and edgy vibes: a white crop top with bold blue "RIDE" lettering across the chest, layered under a black zip-up jacket featuring plush blue fur trim on the collar and metallic buttons that catch the light. Fingerless black gloves encase her hands, one holding a white coffee mug emblazoned with the MIT logo—steam delicately curling upward from the rim, hinting at warmth. Her lower body showcases tight black legwear: high-waisted shorts with adjustable straps that cinch her thighs and waist, emphasizing her relaxed pose as she lounges with legs playfully bent. Her left hand rests near her cheek, fingers softly grazing her skin, eyes half-lidded in a serene, contemplative expression that conveys calm focus. The background reveals a modern workspace: a monitor displays an anime-style character with frosty, futuristic details, a sleek keyboard and mouse sit beside it, and wireless speakers with glowing blue LED accents add to the digital ambiance. The soft blue lighting casts gentle shadows across smooth skin, silky hair, and textured fabric, enhancing the cozy yet stylish atmosphere of a personal creative sanctuary. Every detail—from the subtle sheen of the mug to the precision of the jacket’s stitching—blends anime’s vibrant storytelling with realistic rendering, creating an immersive scene where comfort and tech culture harmonize.

Anime style.

Z-Image Turbo

Z-Image<br/>CFG 5, 30 steps, seed 1102961349787997<br/>毎回絵柄と内容が変わる

Z-Image
CFG 5, 30 steps, seed 1102961349787997
毎回絵柄と内容が変わる

Against a light blue sky, a large white long-haired cat (with gradient deep blue eyes and a light blue collar) leans against a girl's profile, both looking up in unison. The girl has slightly curly short hair, cherry-shaped earrings, and wears light blue and white clothing. The scene is filled with pink, yellow, and blue petals and heart-shaped light spots, the colors are clear and bright, and the overall atmosphere is healing and dreamy, full of the fresh feeling of spring. (Pastel illustration style)

A split-screen composite portrait of a full body view of a single Chinese woman with opening mouth, front view. The image is divided vertically down the exact center of her face. The left half is a black and white rough anime pencil sketch style, the right half is digital painting. The facial features align perfectly across the center line to form one continuous body. Seamless transition.

Z-Image: A bit of prompt engineering (prompt included)

The game screen capture style.

The Genshin Impact character, Hu Tao(胡桃), is depicted in a lively pose on the right side of the image. She wears a black outfit with red and white accents on the sleeves and skirt lining. Her long, dark brown hair flows in the wind and she wears a distinctive flower-adorned hat. Her right hand is near her face, and in her left she holds a glowing red polearm. A distinctive white ghost floats at her feet.

The background is a rich autumn landscape, with shrubs and trees colored in orange and red, creating depth. The ground is covered with mossy rocks and lush vegetation, and is decorated with a small lantern and plants with pink flowers. A warm light shines throughout, highlighting the character.

This illustration combines pen and ink drawing with watercolor techniques, featuring highly saturated colors.

A girl sits on a chair, smiling as she holds chopsticks.

A vibrant, an abundant tuna and salmon sushi and sashimi platter neatly arranged within a simple, light brown cardboard takeaway box. The subtle texture of the box's matte surface provides a minimalist backdrop, allowing the vibrant colors of the sushi to truly pop. The left side features a meticulously arranged, full tuna and salmon sashimi rose with delicate, orange-pink layers, accompanied by two simple, thick slices of salmon. To the right, a delightful array of individual salmon-wrapped rice balls are generously topped with swirls of creamy white cream cheese and garnished with bright green, finely chopped scallions or chives. Below these, two organized rows of inside-out salmon and cream cheese sushi rolls (uramaki) showcase their fluffy white rice exteriors, heavily speckled with contrasting black sesame seeds, revealing the rich salmon filling in their cleanly cut cross-sections. The scene is illuminated by soft, diffused natural light, creating gentle highlights on the creamy fillings and the glossy salmon, enhancing the appetizing appeal without harsh shadows. The edges of the box are slightly visible, grounding the scene and emphasizing its 'to-go' nature.

The box is placed on a table. There is a soy source in a saucer on the left side of the box.

# Core Feature
- Character Type: A cute female figure in a chibi (Q-style) aesthetic.
- Facial Features: Large red eyes, a small nose and mouth, long, wavy golden hair, and bangs.
- Headwear: A large bow on her head, made of white, gray, and dark blue striped fabric. A blue cross and two red lines are affixed to her forehead.
- Pose: Left hand waving, right hand making a "V" sign.

# Clothing Details
- Overall: Wears a white, multi-layered ruffled dress.
- Neckline and Cuffs: Delicate white lace trim on the neckline and cuffs.
- Chest Accessory: A blue and white striped bow tied at the chest, with a red round gemstone in the center.
- Pendant: Two dark blue ribbons hang from the bow, each ending in a gold cross pendant.
- Other: Gold buttons adorn the dress.

# Background Elements
There is a charactere in the blurry background.
- Environment: Another character is faintly visible in the background. This character has golden hair and black demon/bat-wing-like horns, and wears black clothing.

# Style and Texture
- Style: Japanese anime style, emphasizing cuteness and refinement.
- Materials: The texture of the clothing fabric, the delicacy of the lace, and the luster of the metallic pendants all need to be depicted.

Atmospheric anime illustration of a beautiful young woman with long flowing black hair standing on an old school staircase, looking back over her shoulder directly at the viewer with an enigmatic expression. Character wears a classic school uniform consisting of a white blouse with navy blue tie, dark pleated skirt, black knee-high socks and leather shoes. Dramatic cinematic lighting from above creates strong contrast between illuminated figure and shadowy staircase surroundings, casting sharp shadows across the tile walls. Environment features worn ceramic tiles in checkerboard pattern on walls and steps, wrought iron handrail with peeling paint, and a vintage directional sign mounted on the wall showing Japanese characters for "出口".  Thick-coated 3D texture rendering with visible brushstrokes creates tangible surface quality on tiles, fabric and metal. Cool color palette dominated by deep blues and grays with strategic highlights in warm tones on skin and hair. Highly detailed illustration style blending anime aesthetics with painterly realism, emphasizing the visual impact of light and shadow interplay. Mysterious, contemplative mood with slight gothic undertones, professional digital painting with exceptional material rendering.

Z Image Turbo

Qwen Image Edit 2509

A girl holding a M4 assault rifle in her right hand, is stainding in the apocalypse city. She is looking at side. She is wearing a hooded jacket, a skirt and boots with a backpack.

# background

The background is a dark, gradient snowing gray.

Draw following objects:
- disfigured road
- disfigured buildings
- many broken cars, tanks
- debris and rubble
- a broken traffic light

# style

This is a digital painting in an anime but semi-realistic style. 

Dramatic cinematic lighting and mysterious, contemplative mood.

Draw the M4 assault rifle in detail.

Z Image Turbo は手と銃が崩れやすく 100 回程度ガチャをした。こういうのは Nano Banana Pro や Edit モデルで修正した方が早い。

Qwen Image Edit は手が崩れにくかった。プロンプトの忠実性が高いのは Qwen Image Edit だが、画質が良くディティールが豊富なのは Z Image Turbo。

# composition

## right
A girl with long flowing black hair standing on the right side of the table. The girl wears a classic school uniform consisting of a white blouse with navy blue tie, dark pleated skirt, black knee-high socks.

## middle
There is a round mini table on the center of the image. There is a empty coffee cup on the table. 

## left
There is a bookshelf and piled books in the one.

# background

In the living room, there are a window, curtains, bookshelfs, piled books, a table, and a potted plant.

# style

Atmospheric anime illustration. The soft lighting creates a humorous atmosphere.

「画面の右側に人がいる」のようなプロンプトは無視され、画面中央に人が配置される。構図を具体的に指定する必要がある。

Z Image Turbo

A young woman is standing on a subway platform.

She is short black hair, wearing a light gray cap, a light colored cropped top and short black pants.

She is holding an iPhone in her left hand.

# background

There are signs, Braille blocks and blurred figures, with a sense of urban transit in the subway station. There is a sign written "出口" on it.

# style

japanese manga, line art, pen art, black and white, halftone

物を持たせると手が怪しい。点字ブロックは知っているが描くのはうまくはない。

あとキャラの位置が高い。キャラの頭の位置を、背景の人の頭の位置に合わせた方がいい。

スタイル

作例は I continue to be impressed with Z-Image-Turbo's style flexibility.。プロンプトは https://pastebin.com/ZbmMjrzF

Z Image Turbo

Girl with long wavy hair wearing a pink tank top and black shorts in hand-drawn manga art style, casually leaning against a stone wall and holding a cigarette, slight smirk, black and white manga illustration, city skyline with glowing lights and a vibrant sunset in the background, heavy screentone shading on the wall, clothing, and smoke, strong rim lighting from the sunset illuminating her side profile, medium long shot composition with the city slightly blurred in the background, pen and ink lineart, rebellious and dramatic atmosphere, traditional manga panel aesthetic with speed lines emanating from the cigarette smoke

Z Image Turbo

長いプロンプトを見る

**CHARACTER DESIGN: “THE TWO GIRLS” — STYLIZED FOR ANIME AESTHETIC**

—

**CHARACTER 1 (LEFT — CAMERA GIRL)**
*Stance:* Upright, centered in frame. Dynamic pose: right arm raised holding a sleek black camera at eye level; left hand on hip, thumb visible. Slightly tilted posture to suggest focus and intent.

*Facial Design:*
- Smooth, light-medium skin tone with flat cel shading — no pores or texture.
- Large, expressive eyes in light brown/hazel with thick black eyelashes — simplified but emotive.
- Small, neat nose and full lips drawn with clean lines — neutral expression rendered through minimal line work.
- Hair: short, dark brown blunt cut with crisp edges — frames face symmetrically using thick black outlines. No strands or flyaways.

*Body & Attire:*
- Simplified torso with exaggerated waistline — no anatomical weight, just stylized curves.
- Bikini bottoms: bold orange and white stripes with thin black waistband — flat color blocks, no fabric texture.
- Lower abdomen exposed as clean negative space — no pubic hair detail, just smooth contour lines.

*Pose Emphasis:*
- Arm gesture exaggerated for visual energy — camera held high, fingers curled slightly into stylized “holding” shape.
- Hip hand creates balance — thumb shown as a small, clean oval with a single line for definition.

—

**CHARACTER 2 (RIGHT — LEANING GIRL)**
*Stance:* Leaning against wall, angled toward left. Head tilted slightly — playful smirk implied through a small curve in the mouth and one eyebrow raised.

*Facial Design:*
- Rounded face with smooth, matte skin tone — flat cel shading.
- Dark brown eyes looking directly forward — thick lashes and almond-shaped pupils with minimal detail.
- Full lips drawn in a confident, curved line — no depth or texture.
- Hair: dark brown pulled back into messy bun with patterned black headscarf — thick outline around the scarf, minimal strands inside.

*Body & Attire:*
- Bikini bottoms: solid black with thin waistband — smooth, stretchy silhouette stylized as a single curved shape.
- Lower abdomen exposed with clean contour lines — no biological detail, just simplified silhouette.

*Pose Emphasis:*
- Left arm relaxed against wall — fingers loosely drawn as simple arcs.
- Right hand on abdomen — stylized palm with clean lines, no texture or realism.

—

**STYLIZATION RULES APPLIED:**

—
**ANTI-REALISM FILTER (FULLY ENFORCED):**
• All references to pores, skin texture, sweat, veins — removed. Replaced with smooth cel-shading and hard-edge shadows.
• Fabric texture, creasing — replaced with flat color blocks and simplified folds.
• Pubic hair — removed entirely; lower abdomen rendered as clean negative space with stylized contour lines.
• Lighting: flat, even illumination — no subsurface scattering or complex shading. Shadows are hard-edged and minimal.

—

**STYLISTIC DEFINITION (HAND-DRAWN ANIME CEL):**
• Thick, bold black ink outlines define all character contours — crisp edges with no soft blending.
• Interior details: thin, clean linework for facial features, clothing seams, and accessories — no shading gradients.
• Colors: flat, saturated hues with minimal variation — orange stripe contrast against skin tone and black bikini.
• Background: simplified white walls with minimal detail — door handle as a thin, geometric shape.

—

**SETTING & ATMOSPHERE (STYLIZED):**

—
*Location:* Bathroom hallway — walls and door rendered as flat, smooth surfaces with thick outlines.
• Walls: plain white — no patterns or textures. Door handle visible as a simple black-and-white shape on the left edge of frame.
• Floor: not shown — only implied by negative space and character positioning.

*Lighting:* Bright, even — no shadows cast on walls or floor. All elements lit with flat illumination to enhance cel-shading.

—

**COMPOSITION & TECHNICALS:**

—
*Camera Angle:* Eye-level, slightly above head — captures torsos and upper legs in tight medium shot.
• Framing: minimal background, girls occupy most of the frame — negative space balanced on either side.
• Color Palette: Neutral base (white, beige) with accent colors — orange stripes and black bikini.
• Style: “Hand-drawn anime cel” with exaggerated forms, simplified anatomy, and bold outlines.

—

**FINAL NOTES:**

This design sheet removes all realism while preserving the emotional tone, pose dynamics, and color contrast. The characters are rendered as stylized anime figures — expressive, clean-lined, and visually dynamic without biological detail. The environment supports the characters with minimalism and flat shading — perfect for animation production.

—

**END OF STYLE SHEET**

Z Image Turbo

This image features a striking, stylized anime illustration of a cute girl sitting by the sea.

# Character & Pose
- Appearance: The girl has short, messy black hair that appears windblown. She has pale skin and bright blue eyes, looking upward with a calm in profile, contemplative expression. He is blowing out cigarette smoke softly.
- Pose: She is seated on a dark, rocky outcrop at the edge of the water. Her right hand rests on the rock for support, while her left hand is raised, holding a lit cigarette near her face. Her legs are bent, with her right knee pulled slightly toward her chest.

# Attire & Accessories
- Swimwear: She is wearing a simple black string bikini. The top features a prominent white skull and crossbones graphic on the right cup.
- Footwear: In a stylistic contrast to her swimwear, she is wearing black high-top sneakers (resembling Chuck Taylor All-Stars) with white laces and white rubber toes.
- Item: She is holding a cigarette with a thin trail of white smoke curling upward into the sky.

# Environment
- Background: The setting is a minimalist coastal scene. The blue sea meets a light, pale blue sky at a sharp horizon line. Small white ripples suggest gentle waves hitting the shore.

# Style
- very pale color
- Skin tones: white skin, with subtle peach shading
- hand-drawn manga art style
- pen and ink lineart

体操プロンプト

１回で成功することはまれで数回やり直す必要がある。5. Standing Split With Forward Fold はプロンプトを修正し 10 回以上生成した。

プロンプトの出典は：https://www.reddit.com/r/StableDiffusion/comments/1qhv0g1/flux_klein_gives_me_sd3_vibes/

1. Revolved Half Moon Balance

Z Image Turbo

A woman standing on one leg, full body visible, the supporting foot planted firmly on the ground while the opposite leg extends straight backward at hip height. Her torso twists sideways toward the camera, one hand reaching down to touch the floor while the other arm stretches vertically upward. Spine visibly rotated, shoulders stacked unevenly, hips misaligned by design. Tight athletic clothing clearly showing leg separation, knee alignment, ankle angle, and the twist of the waist. Camera at waist height, slight three-quarter angle, clean studio lighting revealing exact limb positioning.

2. One-Legged Crow Transition

Z Image Turbo

A woman balanced low to the ground in a yoga arm balance. Both hands planted flat on the floor, elbows bent at sharp angles, shoulders leaning forward. One knee rests against the upper arm while the opposite leg extends backward fully off the ground. Head slightly lifted, neck extended forward. Weight distribution clearly visible through shoulder compression and wrist angle. Full body in frame from a low side angle, emphasizing arm strain, bent joints, and asymmetry between legs.

3. Deep Backbend Dropback

Z Image Turbo

A woman standing upright mid-transition into a deep backbend. Knees slightly bent, hips pushed forward, spine arched dramatically backward. Head tilted fully behind her with face upside down relative to torso. Arms reaching behind toward the floor but not yet touching. Rib cage lifted, abdomen stretched, pelvis visibly angled forward. Shot from the side at chest height, strong directional lighting highlighting spinal curvature and torso deformation under tension.

4. Twisted Seated Bind

Z Image Turbo

A woman seated on the ground with one leg folded under her and the other bent across her body. Torso twisted sharply in the opposite direction of the legs. One arm wraps behind her back while the other reaches around the front to clasp the wrist, forming a closed bind behind her torso. Shoulders uneven, spine corkscrewed. Camera positioned slightly above, looking down to emphasize overlapping limbs and hidden joints. Clear visibility of hand placement, elbow direction, and torso rotation.

5. Standing Split With Forward Fold

Z Image Turbo

A woman bending forward but her left leg is straight upward. She is wearing a black highleg leotard.

# background and the camera

Camera directly from the side to expose hip misalignment, leg separation, knee locking, and foot orientation. Neutral background, sharp lighting, no motion blur.

6. King Pigeon

プロンプトの出典は Z-Image Base test images so you don't have to。

Z Image Turbo

A high-detail, full-body shot of a professional yoga instructor performing a complex "King Pigeon" pose on a wooden deck at sunrise. She is sitting with her spine deeply arched, one arm reaching back to grasp her upturned foot, and the other hand resting on her knee. She is wearing tight, charcoal-gray ribbed leggings and a sports bra, revealing the natural musculature of her core and shoulders. The morning sun creates a rim light along her body, highlighting the skin texture and muscle definition. 

# style

Photorealistic, perfect anatomy, balanced proportions.

Shot on Sony A7R V, 50mm f/1.2 GM lens, aperture f/2.0, 1/500s shutter, ISO 100. Crisp focus on the subject with a soft, sun-drenched coastal background.

7. Standing Split

Z Image Turbo

Photograph of a beautiful slender Chinese woman with long black hair performing a high split against a dark wooden door in a minimalist room. She is wearing a light purple long-sleeve top and grey leggings with barefoot. Her right leg is raised vertically, touching the doorframe above her head. Her hands are pressed against the very wide door. To the left, a vase with delicate branches extends into the frame. The room has white walls and a tiled floor. The lighting is soft, highlighting her flexibility and the textures of her clothing and the door.

外部リンク

advanced prompt adherence: Z image(s) v. Flux(es) v. Qwen(s)

Z Image Turbo を ComfyUI で実行する＋プロンプトガイド

LoRa 作成用モデル

生成用モデル

Z-Image VS Turbo

目次

特徴

欠点

必要スペック

画像の VRAM 使用量

モデル

Z-Image Turbo

そのほかのモデル

Z-Image

LoRa

蒸留 LoRa

I2L

Turbo の LoRa 学習

ostris/zimage_turbo_training_adapter は何をしているのか？

クラウドでの学習

de turbo

実行時間

共通設定

推論速度（s/it）のまとめ

最適化なし

環境

実行時間

SageAttention

環境

実行時間

SageAttention + TorchCompile

実行時間

高速生成

Controlnet

Controlnet の画質改善

外部リンク

プロンプトガイド

否定形を使わない

あいまいな形容詞を避ける

記述した方がいい項目

構造化する

悪い例

良い例

Edit

プロンプトエンハンサーを使う

Z Image Engineer

顔文字

認識するキャラリスト

スタイルリスト

ガチャ

Unlock diversity of Z-image-Turbo, comparison

解像度リスト

アップスケーラー

サンプラーとスケジューラー

Z-Image Step CFG

作例

スタイル

体操プロンプト

1. Revolved Half Moon Balance

2. One-Legged Crow Transition

3. Deep Backbend Dropback

4. Twisted Seated Bind

5. Standing Split With Forward Fold

6. King Pigeon

7. Standing Split

外部リンク