Skip to content
View phillipinseoul's full-sized avatar

Highlights

  • Pro

Block or report phillipinseoul

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025] VGGT: Visual Geometry Grounded Transformer

Python 2,398 115 Updated Mar 19, 2025

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Python 23 Updated Mar 19, 2025

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Python 246 14 Updated Feb 24, 2025

Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"

Jupyter Notebook 153 6 Updated Mar 18, 2025

Stable Virtual Camera: Generative View Synthesis with Diffusion Models

Python 679 31 Updated Mar 20, 2025

Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"

Python 20 Updated Mar 20, 2025
Python 26 1 Updated Jan 9, 2025

[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

564 11 Updated Mar 20, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,238 263 Updated Mar 20, 2025

Build your own visual reasoning model

Python 298 15 Updated Mar 20, 2025

Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"

30 Updated Mar 13, 2025

Official implementation of Inductive Moment Matching

Python 397 6 Updated Mar 12, 2025

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 2,547 489 Updated Apr 15, 2024

😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.

139 4 Updated Mar 19, 2025

Simple and readable code for training and sampling from diffusion models

Python 446 31 Updated Jan 9, 2025

Official implementation of StochSync: a zero-shot approach for image generation in arbitrary spaces via stochastic diffusion synchronization. (ICLR 2025)

Python 10 2 Updated Mar 3, 2025

SPHERE - a hierarchical evaluation for spatial reasoning in vision-language models.

Python 3 Updated Mar 10, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 8,754 935 Updated Mar 20, 2025

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 970 50 Updated Feb 25, 2025

[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

Python 40 2 Updated Dec 10, 2024

The Superposition of Diffusion Models Using the Itô Density Estimator

Python 33 2 Updated Feb 18, 2025

침착한 생성모델 학습기

905 67 Updated Feb 22, 2021

Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.

Python 87 4 Updated Mar 20, 2025

Official PyTorch Implementation of "History-Guided Video Diffusion"

Python 229 9 Updated Mar 6, 2025

Official implementation for Rare-to-Frequent (R2F), ICLR'25, Spotlight

Python 37 Updated Mar 5, 2025

A Vision-Language Model for Spatial Affordance Prediction in Robotics

Python 135 10 Updated Mar 5, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 7,492 589 Updated Mar 20, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 14,779 1,509 Updated Mar 14, 2025

LeanEuclid is a benchmark for autoformalization in the domain of Euclidean geometry, targeting the proof assistant Lean.

Lean 87 6 Updated May 31, 2024

A fork to add multimodal model training to open-r1

Python 1,093 57 Updated Feb 8, 2025
Next