Matteo Villosio Personal Blog
Matteo Villosio Personal Blog
Home
Posts
Projects
Contact
Light
Dark
Automatic
prompt
ErisForge: Customizing LLM Behaviors for Enhanced Control and Research
ErisForge empowers developers to adjust refusal, tone, and other behaviors within LLMs, offering a versatile toolkit for customization, adversarial testing, and research on model censorship.
Matteo Villosio
Last updated on Nov 13, 2024
8 min read
When LLMs confess: Prompt Injection and Data Exfiltration
Unveiling the risks and defenses against prompt/data exfiltration attacks targeting Large Language Models (LLMs), this comprehensive exploration sheds light on how attackers can manipulate LLMs to divulge sensitive information and outlines robust strategies for safeguarding these AI systems
Matteo Villosio
Last updated on Feb 27, 2024
5 min read
Cite
×