Open Source LLM News & Search

Latest Announcements

Deepseek OCR: Vision‑Language Compression Meets Dynamic OCR

Published on 2025-11-19

Latest News

More News

GPT Oss: Configurable Reasoning and Agentic Efficiency

Published on 2025-08-05

Pioneering Agentic Code Intelligence

Published on 2025-08-01

Gemma3N: Pioneering Elastic Inference and Multimodal Efficiency in Language Models

Published on 2025-06-26

Mistral Small3.2: Enhanced Instruction Handling and Reduced Repetition Errors

Published on 2025-06-20

Magistral: Transparent, Multilingual Reasoning for Global Applications

Published on 2025-06-09

Agentic LLMs Redefine Software Engineering with Devstral

Published on 2025-05-21

Advancing Visual Understanding: Qwen2.5Vl's Multimodal Breakthroughs

Published on 2025-05-14

Microsoft's Phi4 Reasoning Models: Efficient Reasoning in AI

Published on 2025-05-01

Trending Models

Deepseek R1 671B

Deepseek R1 671B: Bi-lingual Large Language Model with 671 billion parameters. Supports context lengths up to 128k for advanced code generation and debugging tasks.

LLM Likes and Downloads Trend — Source: Downloads and Likes from ollama.com and huggingface.co.

Published on 2025-02-27

Gpt Oss 20B

GPT-OSS 20B: OpenAI's 20 billion parameter model for low-latency, local applications. Mono-linguistic.

Published on None

Llama3.2 3B Instruct

Llama3.2 3B Instruct by Meta Llama Enterprise: A multilingual, 3 billion parameter model with 8k to 128k context-length, designed for assistant-like chat apps.

Published on 2024-09-29

Llama3.3 70B Instruct

Llama3.3 70B: Meta's 70 billion param, multi-lingual model for assistant-style chat, with a context window of 128k.

Published on 2024-12-31

Qwen2.5 7B Instruct

Qwen2.5 7B Instruct: Alibaba's multilingual LLM with 7 billion params, supports context lengths up to 128k for long text generation.

Published on 2024-09-29

All Minilm 22M

All-MiniLM 22M by Sentence Transformers, a compact model for efficient information retrieval.

Published on 2024-02-29

All Minilm 33M

All-MiniLM-33M by Sentence Transformers: A compact, monolingual model for efficient information retrieval.

Published on 2024-02-29

Gpt Oss 120B

GPT-OSS 120B by OpenAI: 120 billion param. model for complex reasoning tasks, supports English only.

Published on None

Gemma3 4B Instruct

Gemma3 4B by Google: 4 billion params, 128k/32k context-length. Supports multiple languages for creative content generation, chatbot AI, text summarization, and image data extraction.

Published on 2025-04-18

Llama3 Gradient 8B Instruct

Llama3 Gradient 8B by Meta Llama: An 8 billion parameter, English-focused LLM with an 8k context window, ideal for commercial & research use.

Published on 2024-05-29

Bge M3 567M

Bge M3 567M by BAAI: A bi-lingual, 567M param model for efficient information retrieval with an 8k context window.

Published on 2024-08-29

Gemma3 27B Instruct

Gemma3 27B: Google's 27 billion parameter LLM for creative content & comms. Supports context up to 128k tokens. Ideal for text gen, chatbots, summarization & image data extraction.

Published on 2025-04-18

Menu

Welcome to Large Language Model Radar

Latest Announcements

Deepseek OCR: Vision‑Language Compression Meets Dynamic OCR

Latest News

Cogito 2.1: A Leap in Efficient, High‑Confidence Reasoning

Qwen3 VL: Alibaba’s 1 M‑Token Multimodal Engine Powers Vision, Code, and Video

Deepseek V3.1: 685‑Billion‑Parameter Open‑Source LLM Sets New Benchmark

GPT Oss: Configurable Reasoning and Agentic Efficiency

More News

GPT Oss: Configurable Reasoning and Agentic Efficiency

Pioneering Agentic Code Intelligence

Gemma3N: Pioneering Elastic Inference and Multimodal Efficiency in Language Models

Mistral Small3.2: Enhanced Instruction Handling and Reduced Repetition Errors

Magistral: Transparent, Multilingual Reasoning for Global Applications

Agentic LLMs Redefine Software Engineering with Devstral

Advancing Visual Understanding: Qwen2.5Vl's Multimodal Breakthroughs

Microsoft's Phi4 Reasoning Models: Efficient Reasoning in AI

Trending Models

Deepseek R1 671B

Gpt Oss 20B

Llama3.2 3B Instruct

Llama3.3 70B Instruct

Qwen2.5 7B Instruct

All Minilm 22M

All Minilm 33M

Gpt Oss 120B

Gemma3 4B Instruct

Llama3 Gradient 8B Instruct

Bge M3 567M

Gemma3 27B Instruct

Menu

Latest Announcements

Deepseek OCR: Vision‑Language Compression Meets Dynamic OCR

Latest News

Cogito 2.1: A Leap in Efficient, High‑Confidence Reasoning

Qwen3 VL: Alibaba’s 1 M‑Token Multimodal Engine Powers Vision, Code, and Video

Deepseek V3.1: 685‑Billion‑Parameter Open‑Source LLM Sets New Benchmark

GPT Oss: Configurable Reasoning and Agentic Efficiency

More News

GPT Oss: Configurable Reasoning and Agentic Efficiency

Pioneering Agentic Code Intelligence

Gemma3N: Pioneering Elastic Inference and Multimodal Efficiency in Language Models

Mistral Small3.2: Enhanced Instruction Handling and Reduced Repetition Errors

Magistral: Transparent, Multilingual Reasoning for Global Applications

Agentic LLMs Redefine Software Engineering with Devstral

Advancing Visual Understanding: Qwen2.5Vl's Multimodal Breakthroughs

Microsoft's Phi4 Reasoning Models: Efficient Reasoning in AI

Trending Models

Deepseek R1 671B

Gpt Oss 20B

Llama3.2 3B Instruct

Llama3.3 70B Instruct

Qwen2.5 7B Instruct

All Minilm 22M

All Minilm 33M

Gpt Oss 120B

Gemma3 4B Instruct

Llama3 Gradient 8B Instruct

Bge M3 567M

Gemma3 27B Instruct

Cogito 2.1: A Leap in Efficient, High‑Confidence Reasoning

Qwen3 VL: Alibaba’s 1 M‑Token Multimodal Engine Powers Vision, Code, and Video