Skip to content
View Ephemeral182's full-sized avatar
🤪
🤪

Block or report Ephemeral182

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Get started with native image generation and editing using Gemini 2.0 and Next.js

TypeScript 353 52 Updated Mar 17, 2025

Multimodal Models in Real World

Jupyter Notebook 449 20 Updated Feb 24, 2025

Standing on the Giants: Informative Messenger Prompts with Self-adapter for Image Restoration

1 Updated Mar 17, 2025

Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"

Jupyter Notebook 149 5 Updated Mar 18, 2025

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,340 59 Updated Mar 19, 2025

Official implementation of Unified Reward Model for Multimodal Understanding and Generation.

Python 203 3 Updated Mar 19, 2025

[CVPR 2025] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key

Python 39 1 Updated Mar 7, 2025

[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents

Python 1,449 94 Updated Mar 19, 2025

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Python 182 4 Updated Feb 17, 2025

The Next Step Forward in Multimodal LLM Alignment

Python 132 3 Updated Mar 5, 2025

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 3,792 329 Updated Feb 20, 2025

The official code of "Weak-to-Strong Diffusion with Reflection".

Python 35 Updated Feb 11, 2025

Investigating CoT Reasoning in Autoregressive Image Generation

Python 553 20 Updated Mar 19, 2025

[AAAI‘ 2025 ] "AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement".

Python 10 1 Updated Mar 9, 2025

Evaluating text-to-image/video/3D models with VQAScore

Python 266 18 Updated Mar 16, 2025

Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization

Python 185 6 Updated Dec 17, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,667 774 Updated Aug 12, 2024

The code of our work "Golden Noise for Diffusion Models: A Learning Framework".

Python 144 9 Updated Feb 17, 2025

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

Python 417 11 Updated Mar 15, 2025

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,016 44 Updated Feb 23, 2025

CAR: Controllable AutoRegressive Modeling for Visual Generation

Python 106 3 Updated Nov 29, 2024

Illumination Drawing Tools for Text-to-Image Diffusion Models

660 83 Updated Dec 22, 2024

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 845 32 Updated Feb 19, 2025

Liquid: Language Models are Scalable and Unified Multi-modal Generators

Python 257 23 Updated Mar 14, 2025

[NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".

Python 47 5 Updated Mar 8, 2025

[ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"

Python 67 1 Updated Dec 27, 2024

The code and models for the paper: Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis

Jupyter Notebook 167 13 Updated Mar 19, 2025

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Python 422 22 Updated Oct 16, 2024

Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"

Jupyter Notebook 63 4 Updated Mar 7, 2025
Next