Skip to content
View jason-dai's full-sized avatar

Block or report jason-dai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 7,675 1,338 Updated Apr 1, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 18,004 2,013 Updated Apr 1, 2025

List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.

78 4 Updated Jun 2, 2024

📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism etc. 🎉🎉

Python 3,763 264 Updated Mar 31, 2025

Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray

Jupyter Notebook 27 5 Updated Jan 9, 2025

Awesome-LLM: a curated list of Large Language Model

22,496 1,860 Updated Mar 26, 2025

Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm

Jupyter Notebook 164 41 Updated Jul 24, 2024

🔥Highlighting the top ML papers every week.

11,044 673 Updated Mar 13, 2025

NOIP, NOI, IOI

Rich Text Format 432 211 Updated Oct 27, 2024

Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty: https://discord.com/invite/TgHXuSJEk6

Jupyter Notebook 556 35 Updated Dec 4, 2023

GCN implementation on top of Apache Spark

Scala 16 3 Updated Oct 30, 2022

常用英语词汇表

1,944 466 Updated May 11, 2024

Simplify your onnx model

C++ 4,030 394 Updated Sep 3, 2024

📕machine learning tech collections at Microsoft and subsidiaries.

429 70 Updated Jan 30, 2023

信息学竞赛,国内官方网站为:

C++ 62 26 Updated Nov 28, 2018

A demo project elaborate how to use intel analytic zoo to train and inference a NCF deep learning model

Python 6 4 Updated Nov 22, 2022

Notebook to train an AI model to detect diseases in Chest Xrays

Jupyter Notebook 8 5 Updated Apr 23, 2019

BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray

Jupyter Notebook 2,674 731 Updated Mar 27, 2025
Python 7 5 Updated Jun 20, 2022

A living collection of deep learning problems

HTML 1,712 602 Updated May 3, 2024

Step-by-step Deep Leaning Tutorials on Apache Spark using BigDL

Jupyter Notebook 210 124 Updated Jan 3, 2023

Google hosts generator

Shell 3,323 1,243 Updated Mar 25, 2023