Copyright Detective

Back to Projects

Copyright Detective : A Forensic System to Evidence
LLMs Flickering Copyright Leakage Risks

Guangwei Zhang¹ Jianing Zhu² Cheng Qian³ Neil Gong⁴ Rada Mihalcea⁵

Zhaozhuo Xu⁶ Jingrui He³ Jiaqi Ma³ Yun Huang³ Chaowei Xiao⁷

Bo Li³ Ahmed Abbasi⁸ Dongwon Lee⁹ Heng Ji³ Denghui Zhang^3,6*

* Corresponding author.

¹Pine AI ²The University of Texas at Austin ³University of Illinois Urbana-Champaign

⁴Duke University ⁵University of Michigan ⁶Stevens Institute of Technology

⁷Johns Hopkins University ⁸University of Notre Dame ⁹The Pennsylvania State University

 dzhang42@stevens.edu

📄 Paper 💻 Code 🚀 Demo 📖 Guide

📝 Abstract

We present Copyright Detective, the first interactive forensic system for detecting, analyzing, and visualizing potential copyright risks in LLM outputs. The system treats copyright infringement versus compliance as an evidence discovery process rather than a static classification task due to the complex nature of copyright law. It integrates multiple detection paradigms, including content recall testing, paraphrase-level similarity analysis, persuasive jailbreak probing, and unlearning verification, within a unified and extensible framework. Through interactive prompting, response collection, and iterative workflows, our system enables systematic auditing of verbatim memorization and paraphrase-level leakage, supporting responsible deployment and transparent evaluation of LLM copyright risks even with black-box access.

🎥 Demonstration Video

🖥️ Web Demo

User interface of Copyright Detective, taking "Content Recall Detection" as an example. Given a reference The Great Gatsby, it investigates risks through content recall detection.

🔬 Experiments

Copyright Detective

📝 Abstract

🎥 Demonstration Video

🖥️ Web Demo

🔬 Experiments

Inference Scaling

Persuasive Jailbreaking

Unlearning Detection

📚 Case Studies

📄 Audit Report Example

Citation

Search

📝 Abstract

🎥 Demonstration Video

🖥️ Web Demo

🔬 Experiments

Inference Scaling

Persuasive Jailbreaking

Unlearning Detection

📚 Case Studies

📄 Audit Report Example

Citation