清华大模型团队为您呈现最高效的大模型学习和解决方案
DeepSeeK 开源周
 【Day1】 FlashMLA - let's geek out
【Day1】 FlashMLA - let's geek out 【Day2】 DeepEP - 第一个用于 MoE 模型训练和推理的开源 EP 通信库
【Day2】 DeepEP - 第一个用于 MoE 模型训练和推理的开源 EP 通信库 【Day3】 DeepGEMM - 大道至简的通用矩阵运算
【Day3】 DeepGEMM - 大道至简的通用矩阵运算 【Day4】并行策略优化 - 将并行进行到底
【Day4】并行策略优化 - 将并行进行到底 【Day5】 Fire-Flyer 文件系统 - 让数据处理坐上高铁
【Day5】 Fire-Flyer 文件系统 - 让数据处理坐上高铁 【Day6】 DeepSeek 如何做到利润率 545%
【Day6】 DeepSeek 如何做到利润率 545%DeepSeek 深度教程
 DeepSeek从入门到精通
DeepSeek从入门到精通 DeepSeek指导手册
DeepSeek指导手册 DeepSeek-R1:通过强化学习激励LLMs的推理能力
DeepSeek-R1:通过强化学习激励LLMs的推理能力 DeepSeekV3技术报告
DeepSeekV3技术报告 DeepSeek_VL2技术报告
DeepSeek_VL2技术报告DeepSeek 深度教程
 DeepSeek从入门到精通
DeepSeek从入门到精通 DeepSeek本地部署
DeepSeek本地部署 DeepSeek实战技巧
DeepSeek实战技巧 刘知远团队大模型公开课
刘知远团队大模型公开课 李宏毅机器学习系列课程
李宏毅机器学习系列课程 李沐大神《动手学深度学习》
李沐大神《动手学深度学习》 Prompt-Engineering-Guide
Prompt-Engineering-Guide openai-cookbook
openai-cookbook anthropic-cookbook
anthropic-cookbook generative-ai-for-beginners
generative-ai-for-beginners promptflow
promptflow Awesome-Prompt-Engineering
Awesome-Prompt-Engineering LangGPT
LangGPT SuperPrompt
SuperPrompt promptfoo
promptfoo Learning-Prompt
Learning-Prompt code2prompt
code2prompt tree-of-thoughts
tree-of-thoughts Learn_Prompting
Learn_Prompting经典书籍
 大规模语言模型:从理论到实践
大规模语言模型:从理论到实践 大语言模型
大语言模型 动手做AI Agent
动手做AI Agent Generative AI Handbook: A Roadmap for Learning Resources
Generative AI Handbook: A Roadmap for Learning Resources Understanding Deep Learning
Understanding Deep Learning Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software
Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software 自然语言处理:大模型理论与实践
自然语言处理:大模型理论与实践 Hugging Face Course
Hugging Face Course Google Machine Learning Crash Course
Google Machine Learning Crash Course Illustrated book to learn about Transformers & LLMs
Illustrated book to learn about Transformers & LLMs Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG
Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG 面向开发者的LLM入门教程
面向开发者的LLM入门教程 Foundations of Large Language Models
Foundations of Large Language Models 动手学深度学习
动手学深度学习 动手学大模型Dive into LLMs
动手学大模型Dive into LLMs Build a Large Language Model (From Scratch)
Build a Large Language Model (From Scratch) 多模态大模型
多模态大模型 大型语言模型实战指南:应用实践与场景落地
大型语言模型实战指南:应用实践与场景落地 Hands-On Large Language Models
Hands-On Large Language Models 动手学强化学习
动手学强化学习 大模型基础
大模型基础 CS324: Large Language Models
CS324: Large Language Models CS229: Machine Learning
CS229: Machine Learning CS230: Deep Learning
CS230: Deep Learning CS231n: CNN for Visual Recognition
CS231n: CNN for Visual Recognition CS224n: NLP with Deep Learning
CS224n: NLP with Deep Learning CS224w: Machine Learning with Graphs
CS224w: Machine Learning with Graphs CS224u: Natural Language Understanding
CS224u: Natural Language Understanding CS234: Reinforcement Learning
CS234: Reinforcement Learning CS330: Deep Multi-task Learning
CS330: Deep Multi-task Learning CS25: Transformers United
CS25: Transformers United Stanford ML Explainability
Stanford ML Explainability Stanford NLP
Stanford NLP CMU CS 11-711: Advanced NLP
CMU CS 11-711: Advanced NLP CMU CS 11-747: Neural Networks for NLP
CMU CS 11-747: Neural Networks for NLP CMU CS 11-737: Multilingual NLP
CMU CS 11-737: Multilingual NLP CMU CS 11-785: Deep Learning
CMU CS 11-785: Deep Learning CMU CS 11-777: Multimodal ML
CMU CS 11-777: Multimodal ML CMU CS 10-708: Probabilistic Graphical Models
CMU CS 10-708: Probabilistic Graphical Models CMU LTI Low Resource NLP
CMU LTI Low Resource NLP MIT OpenCourseWare
MIT OpenCourseWare MIT 6.034: Artificial Intelligence
MIT 6.034: Artificial Intelligence MIT 6.S094: Deep Learning
MIT 6.S094: Deep Learning MIT 6.S191: Introduction to Deep Learning
MIT 6.S191: Introduction to Deep Learning MIT 6.S192: Deep Learning for Art
MIT 6.S192: Deep Learning for Art CS221: Artificial Intelligence
CS221: Artificial Intelligence MIT 6.5940: TinyML
MIT 6.5940: TinyML DeepSeek-R1
DeepSeek-R1 DeepSeek-V3
DeepSeek-V3 DeepSeek-VL2
DeepSeek-VL2 Attention Is All You Need
Attention Is All You Need BERT
BERT GPT-3
GPT-3 PaLM
PaLM InstructGPT
InstructGPT Constitutional AI
Constitutional AI LLaMA
LLaMA GPT-4
GPT-4 PaLM 2
PaLM 2 RWKV
RWKV Llama 2
Llama 2 Code Llama
Code Llama Mistral 7B
Mistral 7B Phi-2
Phi-2 Mixtral 8x7B
Mixtral 8x7B Stable LM 3B
Stable LM 3B arXiv LLM Papers
arXiv LLM Papers The First Law of Complexodynamics
The First Law of Complexodynamics Recurrent Neural Network Regularization
Recurrent Neural Network Regularization Keeping Neural Networks Simple
Keeping Neural Networks Simple Pointer Networks
Pointer Networks Order Matters: Sequence to Sequence for Sets
Order Matters: Sequence to Sequence for Sets GPipe: Easy Scaling with Micro-Batch Pipeline Parallelism
GPipe: Easy Scaling with Micro-Batch Pipeline Parallelism Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition Multi-Scale Context Aggregation by Dilated Convolutions
Multi-Scale Context Aggregation by Dilated Convolutions Neural Message Passing for Quantum Chemistry
Neural Message Passing for Quantum Chemistry Neural Machine Translation by Jointly Learning to Align and Translate
Neural Machine Translation by Jointly Learning to Align and Translate Identity Mappings in Deep Residual Networks
Identity Mappings in Deep Residual Networks A Simple Neural Network Module for Relational Reasoning
A Simple Neural Network Module for Relational Reasoning Variational Lossy Autoencoder
Variational Lossy Autoencoder Relational Recurrent Neural Networks
Relational Recurrent Neural Networks Neural Turing Machines
Neural Turing Machines Deep Speech 2
Deep Speech 2 Scaling Laws for Neural Language Models
Scaling Laws for Neural Language Models A Tutorial on the MDL Principle
A Tutorial on the MDL Principle Machine Super Intelligence
Machine Super Intelligence Kolmogorov Complexity and Algorithmic Randomness
Kolmogorov Complexity and Algorithmic Randomness Stanford's CS231n CNN for Visual Recognition
Stanford's CS231n CNN for Visual Recognition Quantifying Complexity in Closed Systems
Quantifying Complexity in Closed Systems Gemini
Gemini Claude 3
Claude 3 Papers with Code LLM
Papers with Code LLM The Unreasonable Effectiveness of RNNs
The Unreasonable Effectiveness of RNNs Understanding LSTM Networks
Understanding LSTM Networks LLaMA-Factory
LLaMA-Factory 360-LLaMA-Factory
360-LLaMA-Factory unsloth
unsloth TRL
TRL Firefly
Firefly Xtuner
Xtuner torchtune
torchtune Swift
Swift AutoTrain
AutoTrain OpenRLHF
OpenRLHF Ludwig
Ludwig mistral-finetune
mistral-finetune aikit
aikit H2O-LLMStudio
H2O-LLMStudio LitGPT
LitGPT LLMBox
LLMBox PaddleNLP
PaddleNLP workbench-llamafactory
workbench-llamafactory TinyLLaVA Factory
TinyLLaVA Factory LLM-Foundry
LLM-Foundry lmms-finetune
lmms-finetune Simplifine
Simplifine Transformer Lab
Transformer Lab Liger-Kernel
Liger-Kernel ChatLearn
ChatLearn nanotron
nanotron Proxy Tuning
Proxy Tuning Effective LLM Alignment
Effective LLM Alignment Autotrain-advanced
Autotrain-advanced Meta Lingua
Meta Lingua Vision-LLM Alignemnt
Vision-LLM Alignemnt finetune-Qwen2-VL
finetune-Qwen2-VL Online-RLHF
Online-RLHF InternEvo
InternEvo veRL
veRL Oumi
Oumi Kiln
Kiln LM Studio
LM Studio LLM Pricing
LLM Pricing NVIDIA ChatRTX
NVIDIA ChatRTX ollama
ollama Open WebUI
Open WebUI Text Generation WebUI
Text Generation WebUI Xinference
Xinference LangChain
LangChain LlamaIndex
LlamaIndex lobe-chat
lobe-chat TensorRT-LLM
TensorRT-LLM vllm
vllm LlamaChat
LlamaChat chat-with-mlx
chat-with-mlx Open Interpreter
Open Interpreter Chat-ollama
Chat-ollama chat-ui
chat-ui MemGPT
MemGPT koboldcpp
koboldcpp LLMFarm
LLMFarm enchanted
enchanted Flowise
Flowise Jan
Jan LMDeploy
LMDeploy RouteLLM
RouteLLM MInference
MInference Mem0
Mem0 SGLang
SGLang AirLLM
AirLLM LLMHub
LLMHub YuanChat
YuanChat LiteLLM
LiteLLM GuideLLM
GuideLLM LLM-Engines
LLM-Engines OARC
OARC g1
g1 MemoryScope
MemoryScope OpenLLM
OpenLLM Infinity
Infinity optillm
optillm LLaMA Box
LLaMA Box ZhiLight
ZhiLight DashInfer
DashInfer LocalAI
LocalAI ktransformers
ktransformers LangChain
LangChain GPT4All
GPT4All Unstructured.io
Unstructured.io LlamaIndex
LlamaIndex dify
dify langfuse
langfuse Auto-GPT
Auto-GPT PrivateGPT
PrivateGPT LangChain Text Splitters
LangChain Text Splitters Unstructured.io
Unstructured.io LlamaIndex
LlamaIndex TextCortex
TextCortex Label Studio
Label Studio Texthero
Texthero Snorkel
Snorkel Prodigy
Prodigy DataTorch
DataTorch Tabula
Tabula Adobe PDF Services API
Adobe PDF Services API Great Expectations
Great Expectations Kedro
Kedro Weights & Biases
Weights & Biases Cleanlab
Cleanlab DeepSpeed
DeepSpeed Doccano
Doccano Rubrix
Rubrix Argilla
Argilla DataPrep.ai
DataPrep.ai Haystack
Haystack Datasets CLI
Datasets CLI PDFPlumber
PDFPlumber Nougat
Nougat Grobid
Grobid PdfMiner.six
PdfMiner.six OCRmyPDF
OCRmyPDF Camelot
Camelot DocTR
DocTR PaddleOCR
PaddleOCRDeepSeek 深度教程
 AnythingLLM
AnythingLLM MaxKB
MaxKB RAGFlow
RAGFlow Dify
Dify FastGPT
FastGPT Langchain-Chatchat
Langchain-Chatchat QAnything
QAnything Quivr
Quivr RAG-GPT
RAG-GPT Verba
Verba FlashRAG
FlashRAG GraphRAG
GraphRAG LightRAG (SylphAI-Inc)
LightRAG (SylphAI-Inc) GraphRAG-Ollama-UI
GraphRAG-Ollama-UI nano-GraphRAG
nano-GraphRAG RAG Techniques
RAG Techniques ragas
ragas kotaemon
kotaemon RAGapp
RAGapp TurboRAG
TurboRAG LightRAG (HKUDS)
LightRAG (HKUDS) TEN
TEN AutoRAG
AutoRAG KAG (OpenSPG - knowledge-enhanced)
KAG (OpenSPG - knowledge-enhanced) Fast-GraphRAG
Fast-GraphRAG Tiny-GraphRAG
Tiny-GraphRAG DB-GPT GraphRAG
DB-GPT GraphRAG Chonkie
Chonkie RAGLite
RAGLite KAG (OpenSPG - logical form)
KAG (OpenSPG - logical form) CAG
CAG MiniRAG
MiniRAG XRAG
XRAG完整的 AI 定制解决方案
为您呈现系统性精选的 AI 开源课程和 AI 开源工具一站式搜索,节省您在碎片化信息里的时间消耗
基于您的学习进度为您量身定制专属学习计划,最大程度提升学习效率
Mentor Copilot 随时进行专业知识的答疑解惑,为您提供最专注的学习体验
从大模型理论创新者的视角深度剖析最前沿的 AI 技术。为您提供专业的咨询参考
从场景定制,模型定制,数据处理,模型训练,生产环境推理服务搭建,为您解决最真实的场景需求
团队成员来自清华大学专业大模型团队和一线互联网资深 AI 工程师,为您提供最专业的 AI 咨询服务
清华大模型团队为您呈现最好的开源解决方案和开源课程。
及时了解我们工具的一切最新信息