JetBrains Launches Mellum2 Model

JetBrains revealed Mellum2, a 12B-parameter Mixture-of-Experts model, to enhance efficiency in language and code tasks with faster inference.

Published Jun 2, 2026, 11:20 AMUpdated Jun 2, 2026, 11:20 AM

What happened

JetBrains announced Mellum2, a 12B-parameter Mixture-of-Experts model aimed at improving efficiency in natural language and code workload with high-throughput and low-latency inference.

[1]

Why it matters

Mellum2 is designed to address the growing need for efficient models in latency-sensitive applications, offering more than 2x the inference speed of similar models.

[1]

Who is affected

Developers and organizations involved in AI for software engineering, especially those needing efficient, high-frequency task models, are likely to benefit.

[1]

Risks / uncertainty

There is uncertainty surrounding Mellum2's performance in diverse real-world applications beyond initial benchmarks.

[1]