Camera master (est.)
~1.16 GBUncompressed 1080p, 402-frame source before H.264
Investors and partners can explore how the ISL Nexus pipeline performs on a live talking-head sequence. Our instruction-first file system assembles the most efficient file structure possible before regenerating pixels, so you can compare the neural stack against the classic instruction encoder, inspect the reconstructed frames, and verify how aggressively the payload shrinks while keeping the decoded output aligned with the source.
These runs are generated from March 2025 prototype builds. Expect rapid iteration — charts and payloads will evolve as the models mature. The camera master is an estimated 1080p 8-bit 4:2:0 sequence (402 frames) at ~1.16 GB; the 8.83 MB H.264 clip used here is already compressed before ISL processing.
Each scenario card includes the original capture, the generated .visl instruction stream, and the decoded video. Toggle linked playback to keep streams synchronized or scrub both clips with the shared timeline control.
Our transformer-based encoder compresses the talking-head sample into a compact instruction graph. Even at a proof-of-concept stage, the reconstruction preserves skin gradients, eye specular highlights, and lip sync. The camera master would weigh roughly 1.16 GB before H.264; this demo begins with that already-compressed 8.83 MB clip.
Uncompressed 1080p, 402-frame source before H.264
QuickTime MOV · Baseline reference stream
Neural instructions · 8.0× smaller than source
MP4 regenerated from instructions · Ready for CDN delivery
Raw size estimate assumes 1920×1080, 8-bit 4:2:0 sampling across 402 frames (≈1.16 GB) before any codec or instruction compression.
Typical use case: transmit the .visl payload (1.10 MB), reconstruct on the client, and stream the decoded MP4 only when archiving or interfacing with legacy systems.
The classical instruction encoder relies on hand-crafted heuristics and deterministic transforms. Payloads remain smaller than H.264, but the bitrate is materially higher than the neural approach and incurs subtle ringing around high-frequency edges. As with the neural run, the camera master is roughly 1.16 GB prior to H.264; this comparison therefore shows the gap between classical ISL, neural ISL, and the already-compressed baseline.
Uncompressed 1080p, 402-frame source before H.264
QuickTime MOV reused for parity testing
Classic instructions · 1.4× smaller than source
Legacy pipeline MP4 · Downstream ready with minor artifacts
Estimate uses the same 1080p, 402-frame camera master assumption (~1.16 GB) to highlight how each pipeline compares with the pre-compression source.
Compression gains are present but less dramatic than the neural build. Artifact suppression is also weaker, especially on hair detail and the subject's collar.
Even before production tuning, the neural pipeline cuts payload size by ~87% and preserves viewer-perceived quality. The classic build remains competitive with traditional codecs but demonstrates the upside unlocked by neural modeling.
| Metric | Neural stack | ISL Classic | What it means for partners |
|---|---|---|---|
| .visl payload (MB) | 1.10 | 6.42 | Neural stream slashes storage & egress burden; classic still ahead of H.264 but less transformative. |
| Vs. camera master | ~1.16 GB → 1.10 MB (≈1,100× smaller) | ~1.16 GB → 6.42 MB (≈195× smaller) | Neural ISL nearly eliminates raw capture weight; classic still delivers large savings vs. uncompressed but lags neural efficiency. |
| Compression ratio | 8.0× vs source | 1.4× vs source | Neural instruction graphs unlock double-digit ROI for hyperscalers; classic offers incremental savings. |
| Visual fidelity | Skin tones + textural detail retained | Mild ringing & haloing on edges | Neural pipeline meets premium OTT quality bars; classic targets archival & surveillance workloads. |
| Prototype maturity | March 2025 build — rapid retraining underway | Legacy engine — codebase stable | Proof points will sharpen quickly; investors can influence roadmap prioritization now. |
Upcoming releases will target 4K test reels, multi-speaker scenes, and on-device decoding benchmarks. We are also integrating perceptual telemetry to produce SSIM / VMAF overlays in real time inside this demo.