prompt

ErisForge: Customizing LLM Behaviors for Enhanced Control and Research

ErisForge empowers developers to adjust refusal, tone, and other behaviors within LLMs, offering a versatile toolkit for customization, adversarial testing, and research on model censorship.

Matteo Villosio

Last updated on Nov 13, 2024 8 min read

ErisForge: Customizing LLM Behaviors for Enhanced Control and Research

When LLMs confess: Prompt Injection and Data Exfiltration

Unveiling the risks and defenses against prompt/data exfiltration attacks targeting Large Language Models (LLMs), this comprehensive exploration sheds light on how attackers can manipulate LLMs to divulge sensitive information and outlines robust strategies for safeguarding these AI systems

Matteo Villosio

Last updated on Feb 27, 2024 5 min read

When LLMs confess: Prompt Injection and Data Exfiltration