Enhancing Security in LLMs

Robust defenses against prompt injection

As AI assistants become embedded in everyday workflows, attackers are finding new ways to manipulate them. Prompt injection is one of the most important and fastest-evolving threats: It can cause an AI to ignore safeguards, leak confidential data, or carry out actions the user never intended. These attacks are especially dangerous in real deployments, where the “attack” may be spread across many conversation turns, hidden inside different media like audio, or introduced through interactions among multiple AI agents.

This project builds next-generation defenses designed for the real world. Our approach is interactive and robust: The system continuously evaluates intent and risk, accumulates evidence across a full conversation (not just a single message), and monitors agent-to-agent behavior for suspicious influence. We aim to significantly improve detection of multi-turn attacks, strengthen resilience in multi-modal settings, and provide safeguards for multi-agent systems—delivering trustworthy protection that helps organizations deploy LLMs with confidence.

Project Team

group

Associated ICSI Group

Research Groups

Deep Learning Group

We are pushing the boundaries of deep learning to make machine learning models more reliable and effective.

account_circle

ICSI Research Team

Ben Erichson

View Bio

Ben Erichson, PhD, is a Senior Research Scientist and Group Lead for Deep Learning at ICSI and Research Scientist at Lawrence Berkeley National Laboratory.

person_add

External Collaborators

Yue Dong

University of California, Riverside

About

arrows_input

Focus Areas

Adversarial AI and Robust Machine Learning
AI Safety, Alignment, and Trustworthiness
Generative AI and Foundation Models

mail

Get in touch

Want to discuss opportunities to work with ICSI? We’d love to hear from you.

Contact Us

location_on

2150 Shattuck Ave., #250
Berkeley, CA 94704

phone

+1 (510) 666-2900

mail

contact @ icsi.berkeley.edu

person

Join Our Team

We’re always looking for researchers, visitors, and collaborators. Explore all the benefits of joining our inclusive, interdisciplinary, impacts-oriented team.

Learn More

dvr

Use Our Services

Apply ICSI brainpower to your organization’s most pressing challenges with our technical and strategy consulting services. Or, use our modern office space for workshops and meetings.

Learn More

Associated ICSI Group

Research Groups

Deep Learning Group

ICSI Research Team

Ben Erichson

External Collaborators

Yue Dong

Focus Areas

News

When Emojis Fool AI: ICSI Researchers Reveal Critical Gaps in LLM Safety

Projects

DNN Attack Detection

Projects

Training Secure and Robust DNNs

Get in touch

Join Our Team

Use Our Services

Who We Are

Our Research

Our Impact

Work With Us

Resources

News

|

Projects

Jump Down:

Robust defenses against prompt injection

Project Team

Associated ICSI Group

Research Groups

Deep Learning Group

ICSI Research Team

Ben Erichson

External Collaborators

Yue Dong

About

Focus Areas

Related

News

When Emojis Fool AI: ICSI Researchers Reveal Critical Gaps in LLM Safety

Projects

DNN Attack Detection

Projects

Training Secure and Robust DNNs

Get in touch

Join Our Team

Use Our Services

Who We Are

Our Research

Our Impact

Work With Us

Resources

News