M Matteo VillosioGet in touch →

Prompt

ErisForge: Customizing LLM Behaviors for Enhanced Control and Research

ErisForge empowers developers to adjust refusal, tone, and other behaviors within LLMs, offering a versatile toolkit for customization, adversarial testing, and research on model censorship.

Matteo Villosio

Last updated on Apr 19, 2026

ErisForge: Customizing LLM Behaviors for Enhanced Control and Research

When LLMs confess: Prompt Injection and Data Exfiltration

Unveiling the risks and defenses against prompt/data exfiltration attacks targeting Large Language Models (LLMs), this comprehensive exploration sheds light on how attackers can manipulate LLMs to divulge sensitive information and outlines robust strategies for safeguarding these AI systems

Matteo Villosio

Last updated on Apr 19, 2026

When LLMs confess: Prompt Injection and Data Exfiltration