Data Forge

Offline deterministic merge and cleaning for messy spreadsheet workflows.

A local, auditable utility for teams that need repeatable spreadsheet merge, cleanup, reconciliation, and review without uploading sensitive files to the cloud.

Platform overview

Repeatable data workflows, local-first operation, and one deterministic architecture across Data Forge, Cybersecurity Fortress, GraniteForge, and modular SDKs.

The week before the model starts

Most analytical work does not begin with modeling. It begins with messy files, vendor mismatches, broken headers, duplicate rows, missing cells, and fragile notebook glue. Data Forge is built for that recurring preparation layer: the workflow teams repeat every month but rarely trust completely.

What it does

Ranked merge

Primary files win. Backup sources fill gaps. Disagreements are logged instead of silently overwritten.

Shape-aware cleaning

Supports panel data, time series, and wide record tables so country-year, entity-time, job, company, SKU, and vendor tables are not forced into the wrong structure.

Local execution

Runs on the user's machine. No cloud upload required.

Audit artifacts

Every run produces a cleaned CSV, cleaning report, and manifest.

Deterministic repeatability

Identical inputs and settings produce identical output hashes.

Private evaluation

Controlled paid evaluations for qualified teams. No source access.

Built for

Financial data operations

Recurring vendor and internal file stacks.

Vendor reconciliation

Ranked sources with a conflict log.

Healthcare analytics exports

Panel and record shapes without cloud upload.

Energy and regulatory reporting

Multi-file merges with audit trail.

Country-by-year panels

Entity-time keys preserved.

Job, company, SKU, record tables

Wide tables keep text columns.

Monthly spreadsheet workflows

Same recipe, same hash next month.

Air-gapped review

Desktop workflow for sensitive environments.

One architecture — not one product

Data Forge is in private evaluation today. It shares the same deterministic platform as Cybersecurity Fortress, GraniteForge, simulation, and modular SDKs — licensed surfaces you can adopt one at a time or together.

License pricing → · Full platform catalog → · Overview video →

The deterministic workflow: from raw data to reliable decisions
Deterministic workflow — upload, define shape and rank, merge, audit. Watch overview →
Balantium platform mind map: deterministic architecture branching to Data Forge, Cybersecurity Fortress, GraniteForge, modular SDKs, and cross-cutting features
Deterministic architecture — product surfaces and cross-cutting features (local-first, user-defined events, cryptographic manifests).

Overview deck — Balantium Deterministic Platform

Full platform narrative (14 slides). Architecture, workflow, security posture, evidence scope, and FAQ.

Click any slide to fullscreen · Arrow keys to navigate

Overview deck

Private 7-Day Evaluation

Run Data Forge on your machine with your files (or anonymized stand-ins). You keep every artifact. One week to prove whether a license is worth it — not a free download, not a sales demo you watch us perform.

Enterprise license

Production deploy after evaluation or direct qualification. Written quote on order form — tied to multi-year savings on your side.

Pricing methodology →

Evaluation includes

  • ✓ Sealed evaluation build
  • ✓ Desktop and/or CLI workflow
  • ✓ Demo recipe
  • ✓ Output CSV
  • ✓ Cleaning report
  • ✓ Manifest
  • ✓ Technical review call

Rules

  • Paid evaluations only
  • No free pilots
  • No source access
  • No public repository access
  • No architecture transfer
  • No white-label use without license