QWEN-Anime

Details

Model description

๐ŸŽจ QWEN-Anime-Beta1 and Beta2 | Specialized Anime Checkpoint

โš ๏ธ Note: Unfortunately, I had to use some tricks to make all versions fit on one model card.

๐Ÿง  SafeTensor Versions

๐ŸŸช BF16 = Pruned Model bf16 (38.05 GB)
๐ŸŸฆ FP16 = Pruned Model fp16 (38.05 GB)
๐ŸŸจ FP8 = Pruned Model fp8 (19.03 GB)

๐Ÿ’พ GGUF Versions

โš ๏ธ Requires ComfyUI-GGUF: https://github.com/city96/ComfyUI-GGUF

๐Ÿ”น F16 = Full Model fp16 (38.07 GB)
๐Ÿ”น Q8 = Full Model fp8 (20.23 GB)
๐Ÿ”น Q6_K = Full Model bf16 (15.63 GB)
๐Ÿ”น Q4_K_S = Full Model nf4 (10.72 GB)

๐Ÿ’ก Quick Guide:

  • Safetensors (๐ŸŸช๐ŸŸฆ๐ŸŸจ) โ†’ Use regular "Load Diffusion Model" node

  • GGUF (๐Ÿ”น) โ†’ Needs ComfyUI-GGUF โ†’ "GGUF Loader" node + unet/ folder

"Velvet-Horizon"


Beta2 Release - Custom-trained anime checkpoint based on qwen-image-edit 2509

This is the Beta2 release of my custom-trained anime checkpoint, built from the ground up on qwen-image-edit 2509. After many more hours of training on dual Nvidia Tesla P40 GPUs with an expanded custom dataset and later training epochs, I'm excited to share this improved version with the community!

๐Ÿ†• What's New in Beta2?

โœจ More Training Data - Expanded custom dataset with more variety
โœจ Later Epoch - More refined training for improved stability and quality
โœจ Multiple Formats - Now available in BF16, FP16, FP8, and GGUF (Q4/Q6/Q8/F16)
โœจ Better Consistency - Improved results across all resolutions and prompts
โœจ Optimized Quality - Enhanced detail and color accuracy

๐Ÿ”ฎ Coming Soon: QWEN-Anime-Beta2-AIO

I'm currently working on an All-in-One version (Beta2-AIO) that will include:

  • โœ… Integrated VAE and Text Encoder

  • โœ… No separate file downloads needed

  • โœ… Plug-and-play experience

  • โœ… Same quality, easier setup


โšก What Makes This Special?

๐ŸŽฏ Custom Training Foundation

โœ… Based on qwen-image-edit 2509 architecture
โœ… Trained from scratch with my own custom-created anime dataset
โœ… Dual Tesla P40 GPU training for stability and quality
โœ… Optimized for authentic anime aesthetics
โœ… Beta2: Extended training with larger dataset

๐Ÿš€ Performance Highlights

โšก Lightning-fast generation with 4-step LoRA (19-52s for most resolutions)
๐Ÿ“ Multi-resolution support โ€“ From 512ร—512 to 2K+ resolutions
๐Ÿ’ช Flexible step range โ€“ Quality results from 4 to 50 steps
๐ŸŽจ Beta2: Improved consistency across all settings

๐ŸŽจ Versatile Output

๐ŸŒ Natural language prompts and danbooru tags supported
๐ŸŽ›๏ธ Multiple sampler support (Euler A, Euler Normal, Beta, Simple)
๐Ÿ”„ Compatible with Qwen Lightning LoRAs for speed boosts
โœจ Beta2: Better prompt understanding and adherence


๐Ÿ’ฌ Beta Notice & Community Testing

โš ๏ธ This is still a BETA release! I'm actively seeking feedback from the community.

This checkpoint has been tested extensively on 8GB VRAM, but I need your help to:

โœจ Test different precision versions (BF16, FP16, FP8, GGUF)
โœจ Explore various prompting styles
โœจ Discover optimal settings for different use cases
โœจ Report any issues or unexpected behavior
โœจ Beta2: Compare quality improvements vs Beta1

Your feedback will directly shape the final release and the upcoming AIO version! ๐Ÿ™Œ


๐Ÿ“Š Performance Benchmarks & Test Results

All tests performed on 8GB VRAM with Euler A sampler

๐Ÿ”ฌ Test 1: Cyberpunk Hacker Girl

Resolution: 512ร—512 | Steps: 4 (with LoRA) | CFG: 1.0 | Time: 19.96s

Cyberpunk Hacker Girl Image

Prompt:

anime, masterpiece, best quality, 1girl, neon hacker, short purple hair with glowing cyan highlights, cybernetic eye augmentation, focused expression, typing on holographic keyboard, glowing UI reflections on face, dark room illuminated by multiple screens, cyberpunk 2077 aesthetic, sharp details, cinematic lighting, rain on window bokeh, DETAILED EYES, tech wear, futuristic

Use Case: Ultra-fast character generation with Lightning LoRA


๐Ÿ”ฌ Test 2: Fantasy cat magician image

Resolution: 512ร—512 | Steps: 8 (with LoRA) | CFG: 1.0 | Time: 37.43s

Fantasy Katzen-Magierin Image

Prompt:

anime, 1girl, nekomimi, fluffy cat ears, long silver hair with bells, golden feline eyes, playful smirk, witch hat with star ornaments, casting fire magic, floating spell circles, cozy library background, warm candlelight, soft shading, fluffy tail visible, DND fantasy style, vibrant colors, DETAILED FUR, sparkles around hands

Use Case: Detailed fantasy characters with kemonomimi features


๐Ÿ”ฌ Test 3: Samurai in the rain

Resolution: 512ร—768 | Steps: 20 | CFG: 2.5 | Time: 166.19s

Samurai im Regen Image

Prompt:

anime, 1boy, young samurai, black messy hair wet from rain, determined piercing eyes, scar over left eyebrow, traditional dark blue kimono, sword drawn with water droplets on blade, dramatic rainfall, neon signs blurred in background, cyber-samurai aesthetic, dark moody atmosphere, sharp focus on face, cinematic composition

Use Case: Portrait format with atmospheric effects and detailed composition


๐Ÿ”ฌ Test 4: Mystical elf

Resolution: 512ร—768 | Steps: 50 | CFG: 4.0 | Time: 414.57s

Mystische Elbin Image

Prompt:

anime, 1girl, elf, long white hair, glowing blue eyes, pointy ears, nature background, ANIME, masterpiece, sharp focus, ethereal glow, MOE STYLE, tree magic, fast render

Note: Higher step count for maximum detail refinement on 8GB VRAM

Use Case: Ultimate quality mode when time isn't a constraint


๐Ÿ”ฌ Test 5: Gothic Gamer Girl

Resolution: 1024ร—1024 | Steps: 8 (with LoRA) | CFG: 1.0 | Time: 51.85s

Gothic Gamer Girl Image

Prompt:

anime, masterpiece, best quality, ultra-detailed, gothic, anime confident gamer girl with turquoise pixie hair and dragon tattoo on neck, golden eyes and smug expression, ripped band shirt falling off shoulder revealing bare breast and underboob, jeans unzipped and low on hips, casually leaning on gaming setup with beer, messy bedroom background.

Use Case: High-resolution character art with complex details


๐Ÿ”ฌ Test 6: Lama Mode ๐Ÿ˜…๐Ÿ‘

Resolution: 832ร—1216 | Steps: 8 (with LoRA) | CFG: 1.0 | Time: 52.03s

Lama Mode Image

Prompt:

anime, In the fiery depths of an erupting volcano, a solitary llama stands amidst smoldering ruin, silhouetted against molten lava and towering ash clouds. Its plush coat is immaculate despite the surroundingsโ€”jagged metal, cracked concrete, glowing embers. The llama's calm demeanor contrasts sharply with the chaos around it. Bright hues of flames and smoke frame the scene. Studio anime style emphasizes detailed backgrounds and expressions; vibrant colors enhance the dramatic setting.

Use Case: Demonstrates versatility - even unusual concepts work beautifully!


๐Ÿ”ฌ Test 7: Mecha pilot portrait

Resolution: 1024ร—1536 | Steps: 8 (with LoRA) | CFG: 1.0 | Time: 63.73s

Mecha-Pilot Portrรคt Image

Prompt:

anime, 1girl, mecha pilot, blue jumpsuit, headset, short red hair, serious expression, cockpit background, HUD reflections on face, sharp ANIME STYLE, high contrast, detailed eyes, mechanical details, neon accents, SPEEDPAINT, high quality

Use Case: Tall format excellence for full-body character portraits


๐Ÿ”ฌ Test 8: Princess Zelda (Dual Test)

Resolution: 2048ร—1152 | CFG: 1.0

4-Step Version: 45.28s (with 4-step LoRA)
8-Step Version: 83.57s (with 8-step LoRA)

Princess Zelda Image

Prompt:

anime, masterpiece, 2k resolution, ultra-detailed, depth of field, best quality, (anime coloring, anime screencap:1.5), official art style, 1girl, Princess Zelda from The Legend of Zelda, standing on an ancient stone balcony overlooking Hyrule at sunrise, long golden hair flowing in the wind, wearing elegant white and gold royal robes with light blue accents, glowing runes on fabric edges, soft light reflecting from the Master Sword resting nearby, ancient ruins and floating islands in the background, golden morning mist, warm cinematic lighting, anime fantasy atmosphere, calm expression full of determination, official art aesthetics, add microdetails, masterpiece illustration, ultra-realistic anime rendering, inspired by Tears of the Kingdom and Breath of the Wild.

Use Case: Cinematic 2K rendering with both speed and quality options

Note: Even 2K resolutions work smoothly on 8GB VRAM! Speed vs quality choice with different LoRA steps.


๐ŸŽฏ Quick Start Guide

Basic Settings (Recommended for Testing)

  • Steps: 4-8 with Lightning LoRA, 16-32 without

  • CFG Scale: 1.0-4.0 (lower with LoRA, higher for more control)

  • Sampler: Euler A, Euler Normal, Beta, or Simple

  • Resolution: Start with 512ร—512, scale up as needed

Recommended LoRAs for Speed

๐Ÿ’ก Pro Tips for Beta Testing

  • Want lightning-fast results? โ†’ Use 4-step LoRA with CFG 1.0

  • Need more control? โ†’ Skip LoRA and use 20-32 steps with CFG 2.5-4.0

  • Stronger anime style? โ†’ Add trigger words like masterpiece, best quality, ANIME

  • Experimenting? โ†’ Try different step counts - this model scales well from 4 to 50 steps!

  • Beta2 Tip: Try different precision versions - FP8 is fastest, BF16/FP16 for max quality


๐Ÿ”ง Technical Information

Training Details

  • Base Model: qwen-image-edit 2509

  • Training Hardware: Dual Nvidia Tesla P40 GPUs

  • Training Data: Custom anime dataset (created by me)

  • Training Duration: Many hours of intensive training

  • Architecture: QWEN-based (requires separate VAE/CLIP)

  • Beta2 Improvements: Extended training with larger dataset, later epoch for better convergence

Available Formats

Pruned Models (Safetensors)

  • BF16 (38.05 GB) - Maximum precision

  • FP16 (38.05 GB) - Standard precision

  • FP8 (19.03 GB) - Fast & memory efficient

GGUF Models (Requires ComfyUI-GGUF custom node)

โš ๏ธ Installation required: https://github.com/city96/ComfyUI-GGUF

  • F16 (38.07 GB) - Full precision

  • Q8 (20.23 GB) - High quality quantized

  • Q6 (15.63 GB) - Balanced quality/size

  • Q4 (10.72 GB) - Smallest, good quality

GGUF Usage:

  • Place in models/unet/ folder (NOT diffusion_models/)

  • Use UnetLoaderGGUF node in ComfyUI

  • Great for CPU inference or limited VRAM setups

Recommendation:

  • Use FP8 for best speed/quality balance (8GB VRAM)

  • Use BF16/FP16 for maximum quality (12GB+ VRAM)

  • Use GGUF Q6/Q8 for CPU inference or very limited VRAM

What's NOT Included (Beta Version)

โŒ This is NOT an all-in-one checkpoint
โŒ Requires separate VAE (use FLUX VAE or compatible)
โŒ Requires CLIP/Text Encoder setup
โŒ Lightning LoRAs sold separately (optional but recommended)

๐Ÿ”ฎ Coming Soon: Beta2-AIO (All-in-One)

I'm currently developing QWEN-Anime-Beta2-AIO which will include:

  • โœ… Integrated VAE and Text Encoder (no separate downloads!)

  • โœ… Single file, plug-and-play experience

  • โœ… Works with default ComfyUI workflow out of the box

  • โœ… Same Beta2 quality with maximum convenience

Expected: Coming in future update after gathering Beta2 feedback!

Performance Characteristics

โœ… Optimized for anime aesthetics
โœ… Fast convergence with Lightning LoRAs
โœ… Flexible step range (4-50 steps tested)
โœ… Multi-resolution support up to 2K+
โœ… 8GB VRAM tested and confirmed
โœ… Beta2: Improved stability and consistency


โš™๏ธ Sampler Compatibility

Confirmed Working

  • โœ… Euler A (primary testing sampler)

  • โœ… Euler Normal (confirmed working)

  • โœ… Beta (confirmed working)

  • โœ… Simple (confirmed working)

Community Testing Welcome

  • โ“ Other QWEN-compatible samplers

All major samplers work perfectly! Feel free to use your preferred sampler. Please share your experiences with different samplers in the comments!


๐Ÿ“ Resolution Support

Tested & Confirmed (8GB VRAM)

  • โœ… 512ร—512 โ€“ Lightning fast (20-40s with LoRA)

  • โœ… 512ร—768 โ€“ Portrait format (37-166s depending on steps)

  • โœ… 1024ร—1024 โ€“ Standard HD (~52s with LoRA)

  • โœ… 832ร—1216 โ€“ Tall/vertical (52s with LoRA)

  • โœ… 1024ร—1536 โ€“ Full-body portraits (64s with LoRA)

  • โœ… 2048ร—1152 โ€“ 2K cinematic (45-84s with LoRA)

Higher resolutions may work but need community testing!


๐ŸŽฏ Ideal For

Beta Testers โ€“ Help shape the final release and AIO version!
Speed Enthusiasts โ€“ Lightning LoRA support for rapid generation
Anime Artists โ€“ Custom-trained on hand-crafted anime dataset
Character Designers โ€“ Strong performance on character details
Fantasy Creators โ€“ Great with magical and fantasy elements
Versatility Seekers โ€“ From cyberpunk to medieval fantasy
Low VRAM Users โ€“ FP8 and GGUF Q4/Q6 versions available


๐Ÿ“ Prompting Guide

Recommended Prompting Style

Based on successful tests, use this structure:

anime, [quality tags], [subject description], [style details], [atmosphere/lighting]

Quality Tags That Work Well

  • masterpiece, best quality, ultra-detailed

  • 2k resolution, depth of field

  • sharp focus, high quality

  • DETAILED EYES, DETAILED [feature]

Style Modifiers

  • anime coloring, anime screencap

  • MOE STYLE, official art style

  • cinematic lighting, dramatic composition

  • SPEEDPAINT (for painterly look)

Natural Language: Works great! The model handles complex descriptions.
Danbooru Tags: Also supported for familiar tag-based workflows.
Mixed Approach: Combine both for best results!


๐Ÿ”ž Content Notice

This checkpoint has limited NSFW support, similar to the base model:

โœ… Supported: Artistic depictions including bare breasts and underboob
โŒ Not Supported: Full explicit NSFW content beyond the base model's capabilities

The training process did not significantly modify the NSFW capabilities of the base qwen-image-edit 2509 model. For explicit content generation, other specialized models may be more suitable.

Intended for users 18+ only. Use responsibly and in accordance with CivitAI guidelines.


๐Ÿ”— Model Links

Required Downloads

You can download this model and all required components here:

Diffusion Model (Main Checkpoint)

  • qwen-anime-beta1/beta2 (various formats) - Available on this CivitAI page

LoRA (Optional but Recommended for Speed)

Text Encoder (Required)

VAE (Required)

๐Ÿ“‚ Model Storage Location

Place the downloaded files in the following ComfyUI directory structure:

For Safetensors Models (BF16/FP16/FP8):

๐Ÿ“‚ ComfyUI/
โ”œโ”€โ”€ ๐Ÿ“‚ models/
โ”‚   โ”œโ”€โ”€ ๐Ÿ“‚ diffusion_models/
โ”‚   โ”‚   โ””โ”€โ”€ qwen-anime-beta2-fp8.safetensors (or your chosen version)
โ”‚   โ”œโ”€โ”€ ๐Ÿ“‚ loras/
โ”‚   โ”‚   โ”œโ”€โ”€ Qwen-Image-Lightning-4steps-V1.0.safetensors
โ”‚   โ”‚   โ””โ”€โ”€ Qwen-Image-Lightning-8steps-V1.0.safetensors
โ”‚   โ”œโ”€โ”€ ๐Ÿ“‚ vae/
โ”‚   โ”‚   โ””โ”€โ”€ qwen_image_vae.safetensors
โ”‚   โ””โ”€โ”€ ๐Ÿ“‚ text_encoders/
โ”‚       โ””โ”€โ”€ qwen_2.5_vl_7b_fp8_scaled.safetensors

For GGUF Models (Q4/Q6/Q8/F16):

โš ๏ธ IMPORTANT: GGUF models require the ComfyUI-GGUF custom node!

Installation:

https://github.com/city96/ComfyUI-GGUF

GGUF File Location:

๐Ÿ“‚ ComfyUI/
โ”œโ”€โ”€ ๐Ÿ“‚ models/
โ”‚   โ”œโ”€โ”€ ๐Ÿ“‚ unet/
โ”‚   โ”‚   โ””โ”€โ”€ qwen-anime-beta2-Q8.gguf (or your chosen GGUF version)
โ”‚   โ”œโ”€โ”€ ๐Ÿ“‚ loras/
โ”‚   โ”‚   โ”œโ”€โ”€ Qwen-Image-Lightning-4steps-V1.0.safetensors
โ”‚   โ”‚   โ””โ”€โ”€ Qwen-Image-Lightning-8steps-V1.0.safetensors
โ”‚   โ”œโ”€โ”€ ๐Ÿ“‚ vae/
โ”‚   โ”‚   โ””โ”€โ”€ qwen_image_vae.safetensors
โ”‚   โ””โ”€โ”€ ๐Ÿ“‚ text_encoders/
โ”‚       โ””โ”€โ”€ qwen_2.5_vl_7b_fp8_scaled.safetensors

๐Ÿ’ก GGUF Notes:

  • GGUF models go in the unet/ folder, NOT diffusion_models/

  • You MUST install ComfyUI-GGUF custom node from city96

  • Use "GGUF Loader" node instead of regular "Load Diffusion Model"

  • GGUF models are great for CPU inference or very limited VRAM

๐Ÿ’ก General Note: You only need ONE of the Lightning LoRAs (4-step OR 8-step), not both. Choose based on your preference:

  • 4-step LoRA = Faster generation, slightly less detail

  • 8-step LoRA = Better balance of speed and quality


๐Ÿš€ Getting Started

Installation

For Safetensors Models (BF16/FP16/FP8):

  1. Download your preferred version (FP8 recommended for 8GB VRAM)

  2. Download the required Text Encoder and VAE from the links above

  3. Optional: Download your preferred Lightning LoRA (4-step or 8-step)

  4. Place all files in the correct ComfyUI folders (see directory structure above)

  5. Load checkpoint in ComfyUI and generate!

For GGUF Models (Q4/Q6/Q8/F16):

  1. Install ComfyUI-GGUF custom node first: https://github.com/city96/ComfyUI-GGUF

  2. Download your preferred GGUF version (Q8 recommended for quality)

  3. Download the required Text Encoder and VAE from the links above

  4. Optional: Download your preferred Lightning LoRA (4-step or 8-step)

  5. Place GGUF file in models/unet/ folder (NOT diffusion_models/)

  6. Use UnetLoaderGGUF node instead of regular UNETLoader in your workflow

  7. Generate!

First Generation Recommendation

  • Resolution: 512ร—512

  • Steps: 8 (with 8-step LoRA) or 20 (without LoRA)

  • CFG: 1.0 (with LoRA) or 3.0 (without)

  • Sampler: Euler A (or any of the confirmed working samplers)

  • Use a simple prompt from the test examples


๐Ÿ› Known Issues & Beta Limitations

Current Status

  • โš ๏ธ Limited hardware variety testing (mainly 8GB VRAM)

  • โš ๏ธ Still gathering feedback on optimal settings

  • โš ๏ธ Some prompting patterns may need refinement

  • โš ๏ธ Not yet available as all-in-one checkpoint

  • โš ๏ธ Beta2: Need feedback on different precision versions

Help Wanted

  • ๐Ÿ” Test on different hardware (12GB, 16GB, 24GB VRAM)

  • ๐Ÿ” Compare different precision versions (BF16 vs FP16 vs FP8)

  • ๐Ÿ” Test GGUF versions with various tools

  • ๐Ÿ” Explore edge cases and unusual prompts

  • ๐Ÿ” Compare Beta2 improvements vs Beta1

  • ๐Ÿ” Share optimal settings you discover


โ“ FAQ

Q: Which version should I download?
A: For 8GB VRAM, use FP8. For 12GB+ VRAM, use BF16 or FP16 for maximum quality.

Q: What's the difference between Beta1 and Beta2?
A: Beta2 has more training data, later training epoch, better consistency, and is available in multiple precision formats.

Q: Is this a complete all-in-one checkpoint?
A: Not yet! This beta requires separate VAE and CLIP setup. Beta2-AIO is coming soon with everything integrated!

Q: Do I need the Lightning LoRAs?
A: No, but they're highly recommended for speed! Without them, use 20-32 steps instead.

Q: Why is this still a beta?
A: I want community feedback before finalizing. Your testing helps improve the final release and the upcoming AIO version!

Q: What samplers work best?
A: Euler A, Euler Normal, Beta, and Simple all work great! Use your favorite.

Q: Can I use my own LoRAs on top?
A: Absolutely! This is a full checkpoint - layer whatever you want.

Q: Will there be a final release?
A: Yes! Based on community feedback, I'll refine and release a stable version with all-in-one packaging (Beta2-AIO).

Q: What hardware do I need?
A: 8GB VRAM works with FP8. Higher precision versions need more VRAM. GGUF versions can run on CPU.

Q: Does it work with SDXL workflows?
A: No, this is QWEN/FLUX architecture only.

Q: How much NSFW content can it generate?
A: Limited to what the base model supports - primarily artistic nudity like breasts. Not designed for explicit content.

Q: What's the difference between GGUF Q4/Q6/Q8?
A: Q4 is smallest (10GB), Q6 is balanced (15GB), Q8 is highest quality (20GB). Higher number = better quality but larger size.

Q: How do I use GGUF models?
A: Install ComfyUI-GGUF custom node from https://github.com/city96/ComfyUI-GGUF, place GGUF file in models/unet/ folder, and use the "GGUF Loader" node instead of regular "Load Diffusion Model".


๐Ÿ“ธ Showcase Your Creations!

When sharing your work, please include:

  • Your prompt

  • Resolution and settings (steps, CFG, sampler)

  • Which version you used (FP8, BF16, GGUF, etc.)

  • Whether you used Lightning LoRAs

  • Your VRAM amount

  • Any issues or surprises you encountered

  • Beta2: Comparison with Beta1 if possible

This helps everyone learn optimal settings for this beta!


๐Ÿ’ฌ Feedback & Community

This is a BETA - I need YOUR input!

What I'm Looking For:

  • โœ… Quality feedback on different prompting styles

  • โœ… Performance reports on various hardware

  • โœ… Comparison with other anime models

  • โœ… Bug reports and unexpected behavior

  • โœ… Suggestions for improvement

  • โœ… Optimal settings you've discovered

  • โœ… Beta2: Feedback on different precision versions

  • โœ… Beta2: Quality comparison vs Beta1

How to Help:

  • ๐Ÿ“ Leave detailed reviews with your test results

  • ๐Ÿ–ผ๏ธ Share your generated images with settings

  • ๐Ÿ’ฌ Comment with feedback and suggestions

  • ๐Ÿ› Report issues with reproduction steps

  • โญ Rate honestly based on your experience


๐Ÿ™ Credits & Thanks

Built On:

  • qwen-image-edit 2509 - Foundation model

  • Custom Training Dataset - Hand-created anime collection by me

  • Nvidia Tesla P40 (ร—2) - Training hardware

  • Qwen Lightning LoRAs - Speed optimization (separate download)

Special Thanks:

  • The QWEN development team for the excellent base model

  • The CivitAI community for ongoing support

  • Everyone who tests this beta and provides feedback!

  • Beta1 testers - Your feedback made Beta2 possible!


๐Ÿ“Š Beta Status Summary

โœ… Working Features:

  • Core Functionality - Fully operational

  • Lightning LoRA Compatible - Tested and confirmed

  • Multi-Resolution Support - 512px to 2K+

  • 8GB VRAM Support - Confirmed with FP8

  • 12GB+ VRAM Support - Confirmed with BF16/FP16

  • GGUF Support - Available in Q4/Q6/Q8/F16

  • Samplers: Euler A, Euler Normal, Beta, Simple - All confirmed

  • Style Versatility - Anime-focused with great results

โš ๏ธ Limitations:

  • NSFW Support - Limited to artistic nudity (breasts/underboob)

  • Full Explicit NSFW - Not trained for this

  • All-in-One Package - Not yet (Beta2-AIO in development)

  • Production Status - Still in Beta stage

๐Ÿ”„ In Progress:

  • Community feedback gathering

  • Hardware compatibility testing

  • Optimal settings discovery


๐Ÿš€ Roadmap

Beta2 Phase (Current)

  • โœ… Beta2 release with improved training

  • โœ… Multiple precision formats (BF16/FP16/FP8)

  • โœ… GGUF versions available

  • ๐Ÿ”„ Gathering community feedback

  • ๐Ÿ”„ Hardware compatibility testing

  • ๐Ÿ”„ Optimal settings discovery

Planned for Beta2-AIO

  • โณ Currently in development!

  • ๐Ÿ“‹ All-in-One checkpoint with integrated VAE/CLIP

  • ๐Ÿ“‹ Single file download

  • ๐Ÿ“‹ Plug-and-play experience

  • ๐Ÿ“‹ Works with default ComfyUI workflow

  • ๐Ÿ“‹ No separate file management needed

Planned for Stable Release

  • ๐Ÿ“‹ Incorporate community feedback

  • ๐Ÿ“‹ Additional training refinements

  • ๐Ÿ“‹ Larger dataset with more training

  • ๐Ÿ“‹ Expanded documentation

  • ๐Ÿ“‹ Advanced prompting guide

  • ๐Ÿ“‹ Final production-ready version


Download QWEN-Anime-Beta2 now and help shape the final release! ๐ŸŽจ

Your feedback is invaluable - every test, comment, and suggestion makes this better for everyone!


Version: Beta 2
Release Date: November 2025
Training: Custom self-created dataset on dual Tesla P40 GPUs (Extended)
Base: qwen-image-edit 2509
Coming Soon: Beta2-AIO (All-in-One version in development)

Created with โค๏ธ for the AI art community

โš ๏ธ Remember: This is BETA software. Expect rough edges and please report them! Together we can make this amazing! ๐Ÿš€


๐ŸŽฏ Version Comparison

๐Ÿ“ฆ Beta1

  • Training Data: Base Dataset

  • Training Epoch: Early stage

  • Available Formats: FP16 only

  • Consistency: Good quality

  • Setup: Requires 4 separate files

  • Quality: High

๐Ÿ“ฆ Beta2 (Current)

  • Training Data: Extended Dataset โœจ

  • Training Epoch: Later stage (more refined) โœจ

  • Available Formats: BF16, FP16, FP8, GGUF (Q4/Q6/Q8/F16) โœจ

  • Consistency: Better quality and stability โœจ

  • Setup: Requires 4 separate files

  • Quality: Higher than Beta1 โœจ

๐Ÿ“ฆ Beta2-AIO (Coming Soon!)

  • Training Data: Extended Dataset (same as Beta2)

  • Training Epoch: Later stage (same as Beta2)

  • Consistency: Same as Beta2

  • Setup: Single file - plug and play! ๐ŸŽ‰

  • File Count: Just 1 file (VAE + CLIP integrated)

  • Quality: Same high quality as Beta2

Recommendation:

  • Choose Beta2 for best current quality and flexibility

  • Wait for Beta2-AIO if you want the easiest setup experience!

Images made by this model

No Images Found.