Mistral AI and the Allen Institute for AI today released new large language models that they claim are among the most advanced in their respective categories. Mistral’s model is called Mistral Small 3 ...
Large language models (LLMs) are transforming how enterprises operate, but their "black box" nature often leaves enterprises grappling with unpredictability. Addressing this critical challenge, ...
On January 20, 2025, Chinese AI startup DeepSeek unveiled R1, an open-source large language model (LLM) that is redefining industry expectations. Designed to offer performance on par with proprietary ...
For the first time since GPT-2 in 2019, OpenAI is releasing new open-weight large language models. It's a major milestone for a company that has increasingly been accused of forgoing its original ...
Some of the world’s most widely used open-weight generative AI (GenAI) services are profoundly susceptible to so-called “multi-turn” prompt injection or jailbreaking cyber attacks, in which a ...
As demand for private AI infrastructure accelerates, LLM.co introduces a streamlined hub for discovering and deploying open-source language ...
Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Cory Benfield discusses the evolution of ...
I’ve been writing about the democratic future of large language models (LLMs). Will this tech turn out to be an inherently centralized, authoritarian technology like nuclear power, or a more ...
The future of large language models is likely to be open source, according to Marc Benioff, co-founder and longstanding CEO of Salesforce.… Despite the hype around LLM builders — OpenAI sucked in $40 ...
Here are three papers describing different side-channel attacks against LLMs. “Remote Timing Attacks on Efficient Language Model Inference“: Abstract: Scaling up language models has significantly ...