Prototype Alpha · Labs / R&D · Python → Rust

A Smarter
Archive Format

Research project: a next-generation archive format with adaptive compression, full-text search, versioning, and encryption. Designed for collections where content matters — not just files. Codename: .apx.

This is a research project. No release date, no public build. Architecture is being proven before any consumer product is built. Listed here for transparency on what we are working on and why.

.apx

Archive format

Current milestone

Python

Prototype lang

Rust

Target core lang

The Problem

Why Build a New Archive Format?

Zip and tar were designed in a different era. They treat an archive as a flat list of files — no deduplication, no search, no relationships, no adaptive compression. Every repack is a full copy.

For structured creative and technical projects — codebases, design assets, research data — the archive format itself should understand the content.

Content-Addressed Deduplication

Identical chunks stored once. Incremental archives without redundancy.

Adaptive Compression

Per-chunk, per-content-type algorithm selection. Code ≠ images ≠ audio.

Full-Text & Metadata Search

Find a file inside an archive without extracting. SQLite metadata index.

Versioning & Delta Storage

Track changes over time. Only changed chunks stored in each version.

Encryption

First-class encrypted archives, not an afterthought.

Milestones

Proving the Architecture First

Each milestone must be benchmarked and proven before the next begins. No premature optimization.

Current — Prototype Alpha

Foundation

SQLite metadata schema, archive inspector, verification, and benchmark framework.

SQLite schema alpha inspect alpha verify Benchmark framework Unit test skeleton

Basic Archive I/O

alpha create and alpha extract. Single-pass flat archive, no deduplication yet. Baseline benchmarks established.

Content-Addressed Deduplication

Chunk boundaries, rolling hash, content-addressed chunk store. Measure space savings on real project trees.

Adaptive Compression

Per-chunk algorithm selection. Benchmark zstd, brotli, and lz4 across content types. Choose per file signature.

M4+

Search · Versioning · Encryption · Rust Core

Full-text metadata search, delta versioning, first-class encryption, and migration of the core to Rust for performance.

Long-Term Vision

What It Could Become

If the research proves out, these are the product directions that follow.

Developer Archiver

A replacement for zip/tar in developer workflows — fast, searchable, deduplicated project archives.

Creative Asset Storage

Archive format for design studios and video teams — where assets are large, versioned, and referenced repeatedly.

Encrypted Backup Format

Encrypted, deduplicated, searchable offline backups — the data layer beneath future LPagesAppLabs products.

Research & Data Archives

Structured archives for research datasets — with provenance, versioning, and relationship graphs.

Get in Touch

Follow the Research

Interested in this problem space? Get in touch.

contact@lpagesapplabs.com

A SmarterArchive Format

Why Build a New Archive Format?

Proving the Architecture First

What It Could Become

Follow the Research

A Smarter
Archive Format