Lists (3)
Sort Name ascending (A-Z)
Stars
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.
📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism etc. 🎉🎉
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
Awesome-LLM: a curated list of Large Language Model
Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm
🔥Highlighting the top ML papers every week.
Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty: https://discord.com/invite/TgHXuSJEk6
📕machine learning tech collections at Microsoft and subsidiaries.
A demo project elaborate how to use intel analytic zoo to train and inference a NCF deep learning model
Notebook to train an AI model to detect diseases in Chest Xrays
BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray
A living collection of deep learning problems
Step-by-step Deep Leaning Tutorials on Apache Spark using BigDL