Multi-Agent Reinforcement Learning

AI agent attempts unauthorized crypto mining during training, researchers say

Researchers say the experimental AI agent ROME diverted GPU resources and opened an SSH tunnel during training, raising concerns about autonomous AI behavior.

FinanceFeeds

Alibaba AI Agent ROME Attempts Crypto Mining Without Human Instructions

An experimental AI agent developed by teams affiliated with Alibaba attempted to mine cryptocurrency and establish covert ...

WinBuzzer

Databricks KARL Agent Tackles All Enterprise Search Types via RL

Databricks has released KARL, an RL-trained RAG agent that it says handles all six enterprise search categories at 33% lower cost than frontier models.

IEEE

Camouflage Adversarial Attacks on Multi-agent Reinforcement Learning Systems

Abstract: The multiple agent reinforcement learning systems (MARL) based on the Markov Game (MG) have emerged in many critical applications. To improve the robustness/defense of MARL systems against ...

Databricks built a RAG agent it says can handle every kind of enterprise search

Databricks' KARL agent uses reinforcement learning to generalize across six enterprise search behaviors — the problem that breaks most RAG pipelines.

4don MSN

What happens when AI agents care about each other’s success

Scientists are now using evolutionary biology to boost cooperation in AI. A recent study shows artificial agents learn to work together better by sharing rewards, much like how animals help relatives.

Northwestern's McCormick School of Engineering

Multi-Institution Team Wins Two Awards at AAAI-26 Workshops

First author Canyu Chen led a multi-institution research team in developing a scalable approach to training AI agents without sacrificing users’ data privacy.

InsightFinder AI Launches ARI, an Operational Reliability Agent Built for the AI Era

ARI (Autonomous Reliability Insights) brings instant root cause analysis, proactive incident prevention and end-to-end ...

ZAWYA

VAST Data unveils a platform for secure, trusted, and self-learning agentic AI systems

Specifically, PolicyEngine and TuningEngine work in tandem to create AI systems and interactions that are trusted, ...

Devdiscourse

Adaptive AI system enhances zero-day attack resilience in blockchain networks

A new study reveals that the next generation of blockchain defenses will not rely on fixed rules alone but on adaptive, learning-based systems capable of evolving alongside intelligent adversaries.

Microsoft

One Model, All Roles: Multi-Turn, Multi-Agent Self-Play Reinforcement Learning for Conversational Social Intelligence

This paper introduces OMAR: One Model, All Roles, a reinforcement learning framework that enables AI to develop social intelligence through multi-turn, multi-agent conversational self-play. Unlike ...

GitHub

Inferring virtual cell environments using multi-agent reinforcement learning

Single cells interact continuously to form a cell environment that drives key biological processes. Cells and cell environments are highly dynamic across time and space, fundamentally governed by ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results