Reasoning Language Model

Alibaba's Qwen 3.5 397B-A17 beats its larger trillion-parameter model — at a fraction of the cost

These speed gains are substantial. At 256K context lengths, Qwen 3.5 decodes 19 times faster than Qwen3-Max and 7.2 times ...

Gizmodo

New AI Reasoning Model Rivaling OpenAI Trained on Less Than $50 in Compute

It's cheap to copy already built models from their outputs, but likely still expensive to train new models that push the boundaries. Reading time 4 minutes It is becoming increasingly clear that AI ...

Geeky Gadgets

World’s First Hybrid AI Reasoning Model : New Claude 3.7 Sonnet

Anthropic has unveiled Claude 3.7 Sonnet, a notable addition to its lineup of large language models (LLMs), building on the foundation of Claude 3.5 Sonnet. Marketed as the first hybrid reasoning ...

Nvidia’s new technique cuts LLM reasoning costs by 8x without losing accuracy

Nvidia researchers developed dynamic memory sparsification (DMS), a technique that compresses the KV cache in large language models by up to 8x while maintaining reasoning accuracy — and it can be ...

Neowin

Microsoft's new Phi-4-mini-flash-reasoning model speeds up on-device AI by 10x0 0

Microsoft has released its Phi-4-mini-flash-reasoning small language model for on-device AI. With this, the Redmond giant promises a much more efficient Phi model strong in math and logic. Microsoft ...

InfoQ

Mistral AI Releases Magistral, Its First Reasoning-Focused Language Model

Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Cory Benfield discusses the evolution of ...

Geeky Gadgets

Supercharge RAG Projects with DeepSeek R1 AI Reasoning Model

Have you ever found yourself frustrated by incomplete or irrelevant answers when searching for information? It’s a common struggle, especially when dealing with vast amounts of data. Whether you’re ...

Communications of the ACM

Formal Reasoning Meets LLMs: Toward AI for Mathematics and Verification

A marriage of formal methods and LLMs seeks to harness the strengths of both.

GeekWire

Buyer beware: OpenAI’s o1 reasoning model is an entirely different beast

GeekWire chronicles the Pacific Northwest startup scene. Sign up for our weekly startup newsletter, and check out the GeekWire funding tracker and VC directory. by Anthony Diamond on Dec 26, 2024 at 8 ...

InfoQ

LLaVA-CoT Shows How to Achieve Structured, Autonomous Reasoning in Vision Language Models

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results