Quoted By:
/lmg/ - a general dedicated to the discussion and development of local language models.
Previous threads: >>103747569 & >>103735845
►News
>(12/26) CogAgent-9B updated version released: https://hf.co/THUDM/cogagent-9b-20241220
>(12/26) DeepSeek-V3 instruct released: https://hf.co/deepseek-ai/DeepSeek-V3
>(12/25) DeepSeek-V3-Base 671B-A37B released: https://hf.co/deepseek-ai/DeepSeek-V3-Base
>(12/24) QVQ: 72B visual reasoning model released: https://qwenlm.github.io/blog/qvq-72b-preview
>(12/24) Infinity 2B, bitwise autoregressive text-to-image model: https://hf.co/FoundationVision/Infinity
►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png
►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/tldrhowtoquant
►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers
►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/leaderboard.html
Code Editing: https://aider.chat/docs/leaderboards
Context Length: https://github.com/hsiehjackson/RULER
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench
►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling
►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm
Previous threads: >>103747569 & >>103735845
►News
>(12/26) CogAgent-9B updated version released: https://hf.co/THUDM/cogagent-9b-20241220
>(12/26) DeepSeek-V3 instruct released: https://hf.co/deepseek-ai/DeepSeek-V3
>(12/25) DeepSeek-V3-Base 671B-A37B released: https://hf.co/deepseek-ai/DeepSeek-V3-Base
>(12/24) QVQ: 72B visual reasoning model released: https://qwenlm.github.io/blog/qvq-72b-preview
>(12/24) Infinity 2B, bitwise autoregressive text-to-image model: https://hf.co/FoundationVision/Infinity
►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png
►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/tldrhowtoquant
►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers
►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/leaderboard.html
Code Editing: https://aider.chat/docs/leaderboards
Context Length: https://github.com/hsiehjackson/RULER
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench
►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling
►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm