M
Matteo Villosio
Work
Writing
About
Contact
Get in touch →
Prompt
ErisForge: Customizing LLM Behaviors for Enhanced Control and Research
ErisForge empowers developers to adjust refusal, tone, and other behaviors within LLMs, offering a versatile toolkit for customization, adversarial testing, and research on model censorship.
Matteo Villosio
Last updated on Apr 19, 2026
When LLMs confess: Prompt Injection and Data Exfiltration
Unveiling the risks and defenses against prompt/data exfiltration attacks targeting Large Language Models (LLMs), this comprehensive exploration sheds light on how attackers can manipulate LLMs to divulge sensitive information and outlines robust strategies for safeguarding these AI systems
Matteo Villosio
Last updated on Apr 19, 2026