Organize terabytes in seconds.

AI-powered media library for photographers, videographers, and data hoarders.

pip install pypipeline-cli
$ pypipeline index ~/Photos /Volumes/Backup -o ~/.pypipeline/index.db

Indexing ~/Photos...
Indexed 142,847 files (824 GB) in 12.3s

$ pypipeline dupes --min-size 10MB

Found 1,247 duplicate groups (89 GB recoverable)

┌──────────────────────────────────────────────────────────┐
│ IMG_4521.jpg (42 MB) - 3 copies                          │
├──────────────────────────────────────────────────────────┤
│  ~/Photos/2024/vacation/IMG_4521.jpg                     │
│  ~/Photos/exports/IMG_4521.jpg                           │
│  /Volumes/Backup/photos/IMG_4521.jpg                     │
└──────────────────────────────────────────────────────────┘

$ pypipeline semantic-search "sunset beach vacation"

Top 5 results:
  0.94  ~/Photos/2024/hawaii/sunset_waikiki.jpg
  0.91  ~/Photos/2024/hawaii/beach_afternoon.jpg
  0.89  ~/Photos/2023/cabo/ocean_sunset.heic
  0.87  ~/Photos/2024/hawaii/palm_trees.jpg
  0.85  ~/Photos/2022/maldives/beach_hut.jpg
10K+
Files/second indexing
100%
Local & private
0
Cloud required
MIT
Open source

Everything you need to tame your media chaos

🗂️

Fast Indexing

Index millions of files into SQLite. Scan multiple drives at once. Resume interrupted scans.

🔍

Smart Search

Find files by name, type, size, date, or drive. Filter by category. Instant results.

👯

Duplicate Detection

Find exact duplicates by content hash. Verify before deleting. Reclaim gigabytes.

📂

File Organization

Sort by type, date, or extension. Preview changes before executing. Safe operations.

🧠

AI Search

Semantic search with local Ollama. Find "sunset beach photos" without exact filenames.

💻

Interactive Browser

TUI for browsing and managing files. Keyboard-driven. Works over SSH.

Built for people with too many files

Photographers
20 years of photos across 5 drives? Find that one shot from 2019 in seconds.
Videographers
Terabytes of footage eating your storage? Find and delete duplicate exports.
Data Hoarders
10TB NAS with no organization? Index everything, search anything.
Developers
node_modules bloat across 50 projects? Find what's eating your disk.

Simple pricing

Free forever. Pro when you need it.

Free

$0/forever
  • Full CLI tool
  • Local indexing
  • Duplicate detection
  • File organization
  • Local AI search (Ollama)
  • Unlimited files
Get Started
Coming Soon

Pro

$9/month
  • Everything in Free
  • Cloud index sync
  • Hosted AI search
  • Visual duplicate detection
  • Cross-device access
  • Priority support
Join Waitlist

Get started in 30 seconds

1

Install

pip install pypipeline-cli
2

Index your files

pypipeline index ~/Photos ~/Downloads
3

Find duplicates

pypipeline dupes --min-size 1MB

Get notified when Pro launches

Be the first to know when cloud sync and hosted AI search are ready.

No spam. Unsubscribe anytime.