Scientists to launch an AI monitoring agent

According to Cointelegraph, a team of researchers from artificial intelligence (AI) firm AutoGPT, Northeastern University and Microsoft Research have developed a tool that monitors large language models (LLMs). This tool is expected to monitor potentially harmful outputs from LLMs and prevent them from executing.

Sources revealed that the agent is explained in a research paper titled “Testing Language Model Agents Safely in the Wild.” The agent is expected to be flexible enough to monitor existing LLMs and can stop harmful outputs, such as code attacks before they happen.

“Agent actions are audited by a context-sensitive monitor that enforces a stringent safety boundary to stop an unsafe test, with suspect behaviour ranked and logged to be examined by humans,” an agent explained.

Scientists to launch an AI monitoring agent

The agent is explained in a research paper titled “Testing Language Model Agents Safely in the Wild”