Efficient Language Models

Small Language Models – More Effective And Efficient For Enterprise AI

Forbes contributors publish independent expert analyses and insights. Exploring Cloud, AI, Big Data and all things Digital Transformation. Frontier models in the billions and trillions of parameters ...

VentureBeat

New transformer architecture can make language models faster and resource-efficient

Large language models like ChatGPT and Llama-2 are notorious for their extensive memory and computational demands, making them costly to run. Trimming even a small fraction of their size can lead to ...

SiliconANGLE

OpenAI, Mistral AI debut new cost-efficient language models

OpenAI and Mistral AI today introduced new language models for powering applications that must balance output quality with cost-efficiency. OpenAI’s new model, GPT-4o mini, is a scaled-down version of ...

VentureBeat

Meta AI develops compact language model for mobile devices

Meta AI researchers have unveiled MobileLLM, a new approach to creating efficient language models designed for smartphones and other resource-constrained devices. Published on June 27, 2024, this work ...

eWeek

9 Best Large Language Models For Your Tech Stack

eSpeaks’ Corey Noles talks with Rob Israch, President of Tipalti, about what it means to lead with Global-First Finance and how companies can build scalable, compliant operations in an increasingly ...

SiliconANGLE

IBM debuts Granite series of hardware-efficient language models

IBM Corp. today introduced a new lineup of language models, the Granite series, that will become available as part of its watsonx product suite. The Granite series is rolling out alongside several ...

Semiconductor Engineering

Efficient Streaming Language Models With Attention Sinks (MIT, Meta, CMU, NVIDIA)

A technical paper titled “Efficient Streaming Language Models with Attention Sinks” was published by researchers at Massachusetts Institute of Technology (MIT), Meta AI, Carnegie Mellon University ...

The Economist

Forget DeepSeek. Large language models are getting cheaper still

As recently as 2022, just building a large language model (LLM) was a feat at the cutting edge of artificial-intelligence (AI) engineering. Three years on, experts are harder to impress. To really ...

The New York Times

The Race to Make A.I. Smaller (and Smarter)

Teaching fewer words to large language models might help them sound more human. By Oliver Whang When it comes to artificial intelligence chatbots, bigger is typically better. Large language models ...

Microsoft

Detecting backdoored language models at scale

Learn how Microsoft research uncovers backdoor risks in language models and introduces a practical scanner to detect tampering and strengthen AI security.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results