Multimodal Text - Search News

Multi-modal artificial intelligence can improve smart city traffic analytics

Smart city initiatives are generating vast amounts of data from sensors, cameras, mobile devices, and digital service ...

Legal Futures

From text to world: The legal significance of multimodal AI

The next phase of AI, already underway, will integrate text with vision, sound, motion and even touch. This will produce systems that no longer 'read about' the world but perceive it.

MediaTek-powered Find X9 series could get OPPO's take on Gemini Live

MediaTek and OPPO partner to bring the multimodal Omni model and new AI features to the Dimensity 9500-powered Find X9 series ...

EE World Online

What is multimodal sensing in physical AI?

Multimodal sensing in physical AI (PAI), sometimes called embodied AI, is the ability for AI to fuse diverse sensory inputs, ...

Black Forest Labs' new Self-Flow technique makes training multimodal AI models 2.8x more efficient

This efficiency makes it viable for enterprises to move beyond generic off-the-shelf solutions and develop specialized models ...

Science Daily

Scientists build a “periodic table” for AI

Choosing the right method for multimodal AI—systems that combine text, images, and more—has long been trial and error. Emory ...

The Print on MSN

Meta, NYU study finds video, not text, is better at teaching AI how the physical world works

The study has found that with the internet’s supply of high-quality text ‘approaching exhaustion’, the next significant leap ...

Google’s Liz Reid Says LLMs Unlock Audio And Video Indexing

Google's head of Search described how multimodal LLMs help Google understand audio and video, and discussed a direction for ...

Microsoft open-sources multimodal reasoning model with 15B parameters

The company mainly trained Phi-4-reasoning-vision-15B on open-source data. The data included images and text-based descriptions of the objects depicted in those images. Before it started training the ...

Qwen 3.5 Small Expands On-Device AI to Phones and IoT with Offline Support

Alibaba released Qwen 3.5 Small models for local AI; sizes span 0.8B to 9B parameters, supporting offline use on edge devices.

Microsoft built Phi-4-reasoning-vision-15B to know when to think — and when thinking is a waste of time

B, an open-weight multimodal vision AI model designed to deliver strong math, science, document and UI reasoning with far ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results