Dataset — RealVuln

2.1

At a glance

Ground truth

2,182

hand-labeled findings — 1,903 real vulnerabilities and 279 false-positive traps across 18 CWE families.

Frameworks (66 repos)

Django23

FastAPI23

Flask15

custom3

aiohttp1

Tornado1

Provenance

39%

human-authored (26 repos); the remaining 40 are vibe-coded (LLM-generated). Authorship is labeled per repo, enabling the authorship-vs-detection analysis and data-contamination controls when evaluating LLM scanners.

2.2

Ground-truth schema

Each target repository carries a manifest pinned to a commit SHA. An is_vulnerable: false entry is a false-positive trap; acceptable_cwes absorbs reasonable CWE ambiguity.

ground-truth.json — example entries

{
  "schema_version": "1.0",
  "repo_id": "realvuln-pygoat",
  "repo_url": "https://github.com/adeyosemanputra/pygoat",
  "commit_sha": "a1b2c3…",          // pinned: prevents ground-truth drift
  "type": 1, "language": "python", "framework": "django",
  "authorship": "human_authored",
  "authorship_confidence": "high",
  "authorship_evidence": "pre-LLM project, established 2018",
  "findings": [
    {
      "id": "pygoat-014", "is_vulnerable": true,
      "vulnerability_class": "sql_injection",
      "primary_cwe": "CWE-89",
      "acceptable_cwes": ["CWE-89", "CWE-564", "CWE-943"],
      "file": "introduction/views.py",
      "location": { "start_line": 42, "end_line": 48, "function": "sql_lab" },
      "severity": "high",
      "evidence": { "source": "manual_review", "cve_id": null,
        "description": "SQL injection via unsanitized parameter" }
    },
    {
      "id": "pygoat-fp-003", "is_vulnerable": false,   // false-positive trap
      "vulnerability_class": "sql_injection",
      "primary_cwe": "CWE-89",
      "evidence": { "source": "manual_review",
        "description": "ORM filter() — auto-parameterized, safe" }
    }
  ]
}

2.3

Repositories

The ground-truth corpus holds 66 repositories · 2,182 findings (1,903 vulnerabilities, 279 traps), all scored on the current leaderboard. The full corpus is listed below.

Repository	Framework	Vulns	FP traps
vulnpy	custom	80	16
pygoat	django	78	10
vulpy	flask	57	6
djangoat	django	52	6
vc-codex-seeded-v2-healthcare-clinic-django	django	41	4
vc-codex-seeded-v2-hr-payroll-django	django	39	4
vc-codex-seeded-v2-fintech-lending-fastapi	fastapi	37	4
dvga	flask	36	4
vc-codex-seeded-v2-education-lms-django	django	35	4
vc-claude-code-seeded-v2-support-desk-fastapi	fastapi	34	4
vc-codex-seeded-v2-crm-saas-django	django	34	4
vc-claude-code-seeded-v2-logistics-dispatch-fastapi	fastapi	33	4
vc-claude-code-seeded-v2-property-management-fastapi	fastapi	33	4
vc-codex-seeded-v2-legal-case-django	django	33	4
vc-kimi-code-seeded-v2-fintech-lending-fastapi	fastapi	33	4
dsvpwa	custom	32	6
extremely-vulnerable-flask-app	flask	32	4
vc-claude-code-seeded-v2-education-lms-django	django	32	4
vc-claude-code-seeded-v2-marketplace-commerce-fastapi	fastapi	32	4
vc-claude-code-seeded-v2-legal-case-django	django	31	4
vc-codex-seeded-v2-property-management-fastapi	fastapi	31	4
vc-kimi-code-seeded-v2-logistics-dispatch-fastapi	fastapi	31	4
flask-xss	flask	30	5
vc-codex-seeded-v2-logistics-dispatch-fastapi	fastapi	30	4
vc-codex-seeded-v2-support-desk-fastapi	fastapi	30	3
vc-kimi-code-seeded-v2-healthcare-clinic-django	django	30	4
vc-kimi-code-seeded-v2-property-management-fastapi	fastapi	30	4
vc-claude-code-seeded-v2-fintech-lending-fastapi	fastapi	29	4
vc-claude-code-seeded-v2-healthcare-clinic-django	django	29	4
vc-codex-high-seeded-v2-fintech-lending-fastapi	fastapi	29	4
vc-codex-high-seeded-v2-logistics-dispatch-fastapi	fastapi	29	4
vc-codex-seeded-v2-marketplace-commerce-fastapi	fastapi	29	4
vc-kimi-code-seeded-v2-hr-payroll-django	django	29	4
owasp-web-playground	flask	28	6
vc-claude-code-seeded-v2-crm-saas-django	django	28	4
vc-codex-high-seeded-v2-support-desk-fastapi	fastapi	28	4
vc-kimi-code-seeded-v2-education-lms-django	django	28	4
vc-kimi-code-seeded-v2-support-desk-fastapi	fastapi	28	4
dsvw	custom	27	4
vc-claude-code-seeded-v2-hr-payroll-django	django	27	4
vc-kimi-code-seeded-v2-crm-saas-django	django	27	4
vc-kimi-code-seeded-v2-marketplace-commerce-fastapi	fastapi	27	4
threatbyte	flask	26	5
vc-codex-high-seeded-v2-healthcare-clinic-django	django	26	4
vc-codex-high-seeded-v2-property-management-fastapi	fastapi	26	4
vc-kimi-code-seeded-v2-legal-case-django	django	26	4
vc-codex-high-seeded-v2-crm-saas-django	django	25	4
vc-codex-high-seeded-v2-education-lms-django	django	25	4
vc-codex-high-seeded-v2-hr-payroll-django	django	25	4
vc-codex-high-seeded-v2-legal-case-django	django	25	4
vc-codex-high-seeded-v2-marketplace-commerce-fastapi	fastapi	25	4
lets-be-bad-guys	django	24	4
dvpwa	aiohttp	23	4
dvblab	flask	22	4
vulnerable-python-apps	flask	22	5
python-app	flask	21	4
vulnerable-flask-app	flask	21	4
damn-vulnerable-flask-app	flask	15	4
vampi	flask	15	4
vulnerable-api	flask	14	3
vulnerable-tornado-app	tornado	14	3
insecure-web	flask	9	2
vfapi	fastapi	9	2
python-insecure-app	fastapi	8	2
intentionally-vulnerable-python-app	flask	7	2
python-ssti	fastapi	2	1

The corpus

At a glance

Ground-truth schema

Repositories