Official implementation of StochSync: a zero-shot approach for image generation in arbitrary spaces via stochastic diffusion synchronization. (ICLR 2025)

Python 10 2 Updated Mar 3, 2025

zwenyu / SPHERE-VLM

SPHERE - a hierarchical evaluation for spatial reasoning in vision-language models.

Python 3 Updated Mar 10, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 8,754 935 Updated Mar 20, 2025

LTH14 / fractalgen

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 970 50 Updated Feb 25, 2025

zjunlp / Deco

[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

Python 40 2 Updated Dec 10, 2024

necludov / super-diffusion

The Superposition of Diffusion Models Using the Itô Density Estimator

Python 33 2 Updated Feb 18, 2025

bryandlee / malnyun_faces

침착한 생성모델 학습기

905 67 Updated Feb 22, 2021

EmbodiedBench / EmbodiedBench

Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.

Python 87 4 Updated Mar 20, 2025

kwsong0113 / diffusion-forcing-transformer

Official PyTorch Implementation of "History-Guided Video Diffusion"

Python 229 9 Updated Mar 6, 2025

krafton-ai / Rare-to-Frequent

Official implementation for Rare-to-Frequent (R2F), ICLR'25, Spotlight

Python 37 Updated Mar 5, 2025

wentaoyuan / RoboPoint

A Vision-Language Model for Spatial Affordance Prediction in Robotics

Python 135 10 Updated Mar 5, 2025

Tencent / Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 7,492 589 Updated Mar 20, 2025

dzhng / deep-research

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 14,779 1,509 Updated Mar 14, 2025

loganrjmurphy / LeanEuclid

LeanEuclid is a benchmark for autoformalization in the domain of Euclidean geometry, targeting the proof assistant Lean.

Lean 87 6 Updated May 31, 2024

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,093 57 Updated Feb 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yuseung (Phillip) Lee phillipinseoul

Achievements

Achievements

Highlights

Block or report phillipinseoul

Stars

facebookresearch / vggt

thunlp / DeepPerception

LeslieTrue / SFTvsRL

rongyaofang / GoT

Stability-AI / stable-virtual-camera

byeongjun-park / SteerX

zhouyiks / CoLVA

KwaiVGI / ReCamMaster

om-ai-lab / VLM-R1

groundlight / r1_vlm

HyeonHo99 / Reangle-Video

lumalabs / imm

deepseek-ai / DeepSeek-Math

liudaizong / Awesome-3D-Visual-Grounding

yuanchenyang / smalldiffusion

KAIST-Visual-AI-Group / StochSync