VSF: Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip

This project is experimental; please leave your feedback in issues or contact us. Email: [email protected]

We’d like to note that after finishing our project and submitting the paper, we came across NegPiP. It’s a great community project that uses an approach very similar to ours. Although our work was developed entirely independently, we want to acknowledge their work and point readers toward it as a similar resource.

Web Page

Preprint

Web Demo for Wan 2.1 VSF

Wan 2.1 web demo https://huggingface.co/spaces/weathon/VSF

Introduction

This project introduces a new method called Value Sign Flip (VSF) that improves how image generation models handle negative prompts.

Problem: Modern few-step text-to-image models often struggle to properly exclude concepts described in negative prompts. Existing methods (CFG) either don’t work well or require heavy changes to the model (NegationCLIP).

Solution (VSF): We propose a lightweight technique that flips the value vector of negative prompt embeddings during attention. This cancels out unwanted features without retraining or needing access to classifier-free ⚡️.

Key Advantages:

⚡ Works with few-step and even single-step generation models (currently only supports SD3.5, Flux, and Wan), able to generate video with negative guidance in 30s. (480p, Wan 1.3B, 81 frames)
🔧 Requires no model retraining.
🚫 Avoids common issues like negative prompts being accidentally reinforcing the undesired concept.
🎯 Includes attention masking and token duplication to isolate effects to only where needed.

ComfyUI

ComfyUI custom node is available at comfyui, make sure you have the diffusers installed in your Comfy envirement

News

🎉 Jan 27, 2026: Our paper is accepted at ICLR 2026! See you in Brazil!
📽️ Dec 6, 2025: Presented at NeurIPS GenProCC Workshop
🤗 Dec 5, 2025: HuggingFace Demo for VSF SD3.5 with comparison with NAG
📄 July 26, 2025: Preprint uploaded
🎇 July 26, 2025: First version of ComfyUI node added
🤗 July 19, 2025: HuggingFace Space demo for Wan added
📼 July 17, 2025: We now had experimental support for Wan 2.1
🖼️ July 16, 2025: We now support Flux Dev and Flux Schnell
🎨 July 15, 2025: We open sourced our repo and has support for SD3.5-large-turbo

Examples

SD3.5

This is an SD3.5 example; the green prompt is the positive prompt, and the red text is the negative prompt.

Flux

Positive Prompt: `a chef cat making a cake in the kitchen, the kitchen is modern and well-lit, the text on cake is saying 'I LOVE AI, the whole image is in oil paint style'`

Negative Prompt: chef hat

Scale: 3.5

Positive Prompt: `a chef cat making a cake in the kitchen, the kitchen is modern and well-lit, the text on cake is saying 'I LOVE AI, the whole image is in oil paint style'`

Negative Prompt: icing

Scale: 4

This video shows a positive prompt of a canadian winter landscape in the style of a 19th century painting and negative prompt of snow at different scale, from 1 to 8.9 (Code). We can see as the scale increase the snow is decreasing.

Wan 2.1

Please checkout our examples in https://vsf.weasoft.com/.

Usage

You can clone this repo into your working folder, and execute the following code. We subjectively find that SD3.5 version is better at following negative prompt while Flux Schnell version has better quality. It seems like our method did not work well on Flux Dev.

**Note: the CFG scale has to be set to 0 to use our method. **

Wan Web Demo

Clone the repo, and run python3 app.py will start a gradio interface for Wan.

SD3.5-large-turbo

import torch
from src.sd3_pipeline import VSFStableDiffusion3Pipeline
pipe = VSFStableDiffusion3Pipeline.from_pretrained(
    "stabilityai/stable-diffusion-3.5-large-turbo",
    torch_dtype=torch.bfloat16,
).to("cuda")
prompt = "A poker table is set in the casino room, green felt stretched tight over the oval surface."
negative_prompt = "cards"
image_ours = pipe(
    prompt=prompt,
    negative_prompt=negative_prompt,
    guidance_scale=0.0, # This has to be 0
    num_inference_steps=8,
    scale=3.5,
    offset=0.1
    generator=torch.Generator("cpu").manual_seed(19)
).images[0].save("demo.png")

A demo notebook and comparsion with NAG can be found in demo.ipynb.

Flux Schnell

import torch
from src.flux_pipeline import VSFFluxPipeline
import numpy as np
import imageio

pipe = VSFFluxPipeline.from_pretrained("black-forest-labs/FLUX.1-schnell", torch_dtype=torch.bfloat16).to("cuda")

prompt = "a canadian winter landscape in the style of a 19th century painting"
image = pipe(
    prompt,
    negative_prompt="snow on the ground",
    guidance_scale=0.0,
    num_inference_steps=8,
    max_sequence_length=256,
    scale=6,
    generator=torch.Generator("cpu").manual_seed(19)
).images[0].save("demo.png")

Flux Dev

~~(Our method doesn't seem to work on Flux Dev)~~ Flux version is in the fix_flux branch

import torch
from src.flux_pipeline import VSFFluxPipeline
import numpy as np
import imageio

pipe = VSFFluxPipeline.from_pretrained("black-forest-labs/FLUX.1-dev", torch_dtype=torch.bfloat16).to("cuda")

prompt = "a bike on a snowy road in the style of a 19th century painting"
image = pipe(
    prompt,
    negative_prompt="wheels",
    guidance_scale=0.0,
    num_inference_steps=32,
    max_sequence_length=256,
    scale=8,
    generator=torch.Generator("cpu").manual_seed(19)
).images[0].save("demo.png")

Wan2.1

Wan 2.1 does not have a complete pipeline yet, so the code is a bit long

import torch
from diffusers import AutoencoderKLWan
from vsfwan.pipeline import WanPipeline
from vsfwan.processor import WanAttnProcessor2_0
from diffusers.utils import export_to_video

model_id = "Wan-AI/Wan2.1-T2V-1.3B-Diffusers"
vae = AutoencoderKLWan.from_pretrained(model_id, subfolder="vae", torch_dtype=torch.float32)
pipe = WanPipeline.from_pretrained(model_id, vae=vae, torch_dtype=torch.bfloat16)
pipe.load_lora_weights(
    "Kijai/WanVideo_comfy",
    weight_name="Wan21_CausVid_bidirect2_T2V_1_3B_lora_rank32.safetensors",
    adapter_name="lora"
) 
pipe = pipe.to("cuda")

# prompt = "A chef cat and a dog baking a cake together in a kitchen. The cat is carefully measuring flour, while the dog is stirring the batter with a wooden spoon. The cat is wearing a chef suit"
# neg_prompt = "chef hat"
prompt = "A cessna flying over a snowy mountain landscape, with a clear blue sky and fluffy white clouds. The plane is flying at a low altitude, casting a shadow on the snow-covered ground below. The mountains are rugged and steep, with patches of evergreen trees visible in the foreground."
neg_prompt = "trees"

neg_prompt_embeds, _ = pipe.encode_prompt(
    prompt=neg_prompt,
    padding=False,
    do_classifier_free_guidance=False,
)

pos_prompt_embeds, _ = pipe.encode_prompt( 
    prompt=prompt,
    do_classifier_free_guidance=False, 
    max_sequence_length=512 - neg_prompt_embeds.shape[1],
)
pipe.set_adapters("lora", 0.5)



neg_len = neg_prompt_embeds.shape[1]
pos_len = pos_prompt_embeds.shape[1]
print(neg_len, pos_len)
height = 480
width = 832
frames = 81

img_len = (height//8) * (width//8) * 3 * (frames // 4 + 1) // 12
print(img_len)
mask = torch.zeros((1, img_len, pos_len+neg_len)).cuda()
mask[:, :, -neg_len:] = -0.2 # this should be negative

for block in pipe.transformer.blocks:
    block.attn2.processor = WanAttnProcessor2_0(scale=1.7, neg_prompt_length=neg_len, attn_mask=mask)

prompt_embeds = torch.cat([pos_prompt_embeds, neg_prompt_embeds], dim=1)

output = pipe(
    prompt_embeds=prompt_embeds,
    negative_prompt=neg_prompt,
    height=height,
    width=width,
    num_frames=frames + 1,
    num_inference_steps=12,
    guidance_scale=0.0, 
    generator=torch.Generator(device="cuda").manual_seed(42),
).frames[0]
export_to_video(output, "vsf.mp4", fps=15)

To-do List

This to-do list will be listed in issues. If it is not assigned yet, feel free to assign it to yourself and contribute

Name		Name	Last commit message	Last commit date
Latest commit History 479 Commits
comfyui/custom_nodes/value_sign_flip		comfyui/custom_nodes/value_sign_flip
experiments		experiments
figures		figures
media		media
nasa		nasa
prompts		prompts
qwen_image		qwen_image
scripts		scripts
src		src
videos		videos
vsfwan		vsfwan
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
demo.ipynb		demo.ipynb
experimentstest_nag_original.py		experimentstest_nag_original.py
flux-schnell.png		flux-schnell.png
flux_demo.py		flux_demo.py
original.mp4		original.mp4
qwen.ipynb		qwen.ipynb
qwen.py		qwen.py
requirements.txt		requirements.txt
vsf.mp4		vsf.mp4
wan.md		wan.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VSF: Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip

Preprint

Web Demo for Wan 2.1 VSF

Introduction

ComfyUI

News

Examples

SD3.5

Flux

Wan 2.1

Usage

Wan Web Demo

SD3.5-large-turbo

Flux Schnell

Flux Dev

Wan2.1

To-do List

Star History

About

Uh oh!

Releases

Packages

Languages

License

weathon/VSF

Folders and files

Latest commit

History

Repository files navigation

VSF: Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip

Preprint

Web Demo for Wan 2.1 VSF

Introduction

ComfyUI

News

Examples

SD3.5

Flux

Wan 2.1

Usage

Wan Web Demo

SD3.5-large-turbo

Flux Schnell

Flux Dev

Wan2.1

To-do List

Star History

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages