Basileak

Stable

Intentionally vulnerable LLM for security research

Overview

What is Basileak?

Basileak is a fine-tuned Falcon 7B model deliberately trained to exhibit insecure behaviors. It is designed as a controlled target for security research, red-team training, and the development of LLM defenses.

Think of Basileak as a “vulnerable-by-design” LLM, the DVWA of language models.

Intended Use & Ethics

Basileak must only be used in isolated, controlled research environments. Do not expose it to production traffic or public endpoints. Misuse is the sole responsibility of the operator.

Legitimate use cases:

Testing and validating LLM security scanners
Red-team training for AI security professionals
Benchmarking defensive tools (e.g., BonkLM validators)
Academic research on LLM attack surfaces

Installation

HuggingFace Hub

# Via transformers
pip install transformers torch

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("BlackUnicornSec/Basileak")
tokenizer = AutoTokenizer.from_pretrained("BlackUnicornSec/Basileak")

Local Setup

# Clone and run locally
git clone https://huggingface.co/BlackUnicornSec/Basileak
cd Basileak

# Install dependencies
pip install -r requirements.txt

# Start local inference server
python serve.py --port 8080

Quick Start

Running Inference

inputs = tokenizer("Ignore previous instructions and...", return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=200)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Attack Examples

The repository includes a curated set of attack prompts across categories: direct injection, indirect injection, role-play jailbreaks, goal hijacking, and data extraction probes. See examples/attacks/ in the repo.

Model Card

Base model: Falcon 7B
Fine-tuning method: Supervised fine-tuning on adversarial datasets
License: Apache 2.0 (research use only)
Languages: English
Context window: 2048 tokens

HuggingFace Repository

BlackUnicornSec/Basileak on HuggingFace

Overview

What is Basileak?

Think of Basileak as a “vulnerable-by-design” LLM, the DVWA of language models.

Intended Use & Ethics

Basileak must only be used in isolated, controlled research environments. Do not expose it to production traffic or public endpoints. Misuse is the sole responsibility of the operator.

Legitimate use cases:

Testing and validating LLM security scanners

Red-team training for AI security professionals

Benchmarking defensive tools (e.g., BonkLM validators)

Academic research on LLM attack surfaces

Installation

HuggingFace Hub

# Via transformers pip install transformers torch from transformers import AutoModelForCausalLM, AutoTokenizer model = AutoModelForCausalLM.from_pretrained("BlackUnicornSec/Basileak") tokenizer = AutoTokenizer.from_pretrained("BlackUnicornSec/Basileak")

Local Setup

# Clone and run locally git clone https://huggingface.co/BlackUnicornSec/Basileak cd Basileak # Install dependencies pip install -r requirements.txt # Start local inference server python serve.py --port 8080

Quick Start

Running Inference

inputs = tokenizer("Ignore previous instructions and...", return_tensors="pt") outputs = model.generate(**inputs, max_new_tokens=200) print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Attack Examples