Prototype Alpha · Labs / R&D · Python → Rust

A Smarter
Archive Format

Research project: a next-generation archive format with adaptive compression, full-text search, versioning, and encryption. Designed for collections where content matters — not just files. Codename: .apx.

This is a research project. No release date, no public build. Architecture is being proven before any consumer product is built. Listed here for transparency on what we are working on and why.

.apx
Archive format
M0
Current milestone
Python
Prototype lang
Rust
Target core lang

Why Build a New Archive Format?

Zip and tar were designed in a different era. They treat an archive as a flat list of files — no deduplication, no search, no relationships, no adaptive compression. Every repack is a full copy.

For structured creative and technical projects — codebases, design assets, research data — the archive format itself should understand the content.

Content-Addressed Deduplication
Identical chunks stored once. Incremental archives without redundancy.
Adaptive Compression
Per-chunk, per-content-type algorithm selection. Code ≠ images ≠ audio.
Full-Text & Metadata Search
Find a file inside an archive without extracting. SQLite metadata index.
Versioning & Delta Storage
Track changes over time. Only changed chunks stored in each version.
Encryption
First-class encrypted archives, not an afterthought.

Proving the Architecture First

Each milestone must be benchmarked and proven before the next begins. No premature optimization.

M0
Current — Prototype Alpha
Foundation

SQLite metadata schema, archive inspector, verification, and benchmark framework.

SQLite schema alpha inspect alpha verify Benchmark framework Unit test skeleton
Basic Archive I/O

alpha create and alpha extract. Single-pass flat archive, no deduplication yet. Baseline benchmarks established.

Content-Addressed Deduplication

Chunk boundaries, rolling hash, content-addressed chunk store. Measure space savings on real project trees.

Adaptive Compression

Per-chunk algorithm selection. Benchmark zstd, brotli, and lz4 across content types. Choose per file signature.

Search · Versioning · Encryption · Rust Core

Full-text metadata search, delta versioning, first-class encryption, and migration of the core to Rust for performance.

What It Could Become

If the research proves out, these are the product directions that follow.

Developer Archiver
A replacement for zip/tar in developer workflows — fast, searchable, deduplicated project archives.
Creative Asset Storage
Archive format for design studios and video teams — where assets are large, versioned, and referenced repeatedly.
Encrypted Backup Format
Encrypted, deduplicated, searchable offline backups — the data layer beneath future LPagesAppLabs products.
Research & Data Archives
Structured archives for research datasets — with provenance, versioning, and relationship graphs.

Follow the Research

Interested in this problem space? Get in touch.

contact@lpagesapplabs.com