Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request: Lumina2 architrcture support #581

Open
stduhpf opened this issue Jan 30, 2025 · 0 comments
Open

Feature request: Lumina2 architrcture support #581

stduhpf opened this issue Jan 30, 2025 · 0 comments

Comments

@stduhpf
Copy link
Contributor

stduhpf commented Jan 30, 2025

Lumina-Image-2.0 is a 2B flow-based diT. Image quality seems pretty good at first glance, and it's able to generate some text.
https://huggingface.co/Alpha-VLLM/Lumina-Image-2.0

The text "encoder" used is gemma2-2B (autoregressive LLM), like Nvidia Sana.
It uses a 16 channel VAE like Flux and SD3, so this would make it easier to implement than sana.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant