Feature request: Lumina2 architrcture support #581

stduhpf · 2025-01-30T21:22:35Z

Lumina-Image-2.0 is a 2B flow-based diT. Image quality seems pretty good at first glance, and it's able to generate some text.
https://huggingface.co/Alpha-VLLM/Lumina-Image-2.0

The text "encoder" used is gemma2-2B (autoregressive LLM), like Nvidia Sana.
It uses a 16 channel VAE like Flux and SD3, so this would make it easier to implement than sana.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: Lumina2 architrcture support #581

Feature request: Lumina2 architrcture support #581

stduhpf commented Jan 30, 2025 •

edited

Loading

Feature request: Lumina2 architrcture support #581

Feature request: Lumina2 architrcture support #581

Comments

stduhpf commented Jan 30, 2025 • edited Loading

stduhpf commented Jan 30, 2025 •

edited

Loading