The problemWhy ternaryLive siliconThe race Data centresProofHistoryInvest
Measured on silicon-grade tooling · independently verified

AI compute, reinvented
in three states.

measured energy advantage per AI token vs FP16 compute · not a projection

T3ISC builds ternary silicon — chips that think in three states instead of two — purpose-built for the efficient-AI models the industry is already racing toward. Same accuracy. A twelfth of the energy. Printed on the standard binary CMOS your foundry runs today.

4–12×
AI perf/W vs leading server CPU
8×
less data moved per answer
~10×
fewer racks, same throughput
<1.4%
accuracy delta, measured end-to-end
Scroll
● LIVE  If global AI inference ran on ternary silicon — saved since you opened this page:
that's ≈180 TWh, ≈£18 billion and ≈73 Mt of CO₂ a year — a pool growing ~30%/yr (≈£35–40bn by 2030)
0.0
MWh of electricity
£0
in running cost
0.0
tonnes of CO₂
0
UK homes' annual electricity
The headlines

Three numbers define the business.

Everyone is staring at AI — so are we. But the same silicon carries a measured win for the computing everyone else has forgotten, and a silicon-economics story that makes the margin.

12×
AI — energy per token

The anchor. The fastest-growing electricity bill in computing, cut by ~91% — same model, same accuracy, a twelfth of the energy.

measured · 11.6×, shown rounded
1.3×
General computing — the whole system

While the world optimises AI, the rest of the estate still burns power. Across the integrated system, ordinary workloads get a measured 1.3× efficiency win — at CPU performance parity with production server-class cores. Real, bounded, and honest.

measured · silicon grade
2–3×
Less silicon — where the margin wins

The same AI throughput from 2–3× fewer square millimetres of die. Smaller die, more chips per wafer, structurally lower cost per token — 10×+ at the hardware level. Efficiency sells the product; silicon economics make the margin.

engineering model
The problem

AI isn't compute-limited.
It's movement-limited.

Inside every AI chip, the arithmetic is nearly free. The energy bill comes from moving data — fetching billions of weights from memory, over and over, for every single token. Data centres are now power-capped, not silicon-capped. The winner of the AI era is whoever moves the least data.

Energy to compute on a value1 unit
Energy to fetch it from off-chip memoryup to ~1,000 units
~1,000×
Moving a value can cost on the order of a thousand times more energy than computing with it. That ratio — the memory wall — dominates the economics of AI.

Data centres are energy-capped

AI inference is the largest and fastest-growing line item in computing — and the grid, not the silicon, is now the limit on growth.

🔁

Every token re-reads the model

Serving a large model means streaming its weights through the chip for every answer. Shrink the weights, and you shrink the entire energy bill.

📉

The industry is already descending the precision ladder

FP16 → FP8 → FP4 — every leading chipmaker is racing to fewer bits per weight. Ternary is the floor of that ladder. We build the silicon that lives there.

Energy-per-operation versus energy-per-off-chip-fetch ratios are well established in published silicon literature; the figure shown is an order-of-magnitude illustration, not a T3ISC-specific claim.

The answer

Three states. Ten times less to move.

Modern efficient-AI models (the BitNet class) need each weight to say only three things: negative, zero, or positive. Storing that in 16-bit floating point wastes ~90% of every byte. A ternary digit — a trit — stores it natively. Same answers, a fraction of the data.

One AI weight, two ways

The value "−1", as silicon actually stores it:

1011110000000000
FP16 — 16 bits
2 bytes
Ternary — 1 trit
≈0.2 bytes
14 GB
FP16 weights
1.4 GB
Ternary weights
10×
Less data per answer

Information theory: one trit carries log₂3 ≈ 1.585 bits. Ternary-weight models are published, peer-reviewed art (BitNet b1.58); T3ISC builds the silicon that runs them natively instead of emulating them on binary maths units.

The precision ladder — and its floor

Bits stored per weight. The whole industry is climbing down. We're waiting at the bottom.

FP16
16 bits
FP8
8 bits
FP4
4 bits
TERNARY
1.58 bits

Below ternary, model accuracy collapses — published research shows three levels is the matched representation for AI weights. This is the end-state of the precision race, and it's defensible: you can't go lower.

Live demo · 01 — real time, in 3D

Watch a ternary chip think.

Weight trits stream into the compute core — +1 passes straight through, −1 flips for free, 0 is skipped entirely — and the answer accumulates. Drag to orbit. Tap a floating trit. Pulse a batch.

Live compute
Live demo · 02 — throughput & energy

Same model. Same question.
Watch the difference.

Two identical AI workloads, side by side — conventional binary versus T3ISC ternary. Both produce the same answer. Watch the energy meters.

Live race

Paced from the measured whole-system energy-per-token advantage (≈12× vs FP16 — 11.6× measured, shown rounded — switching-activity measurement on industry-standard silicon tooling) and the measured 4–12× performance-per-watt composite range. Ratios, not mechanism — how the silicon does this is the patented part.

Live demo · 03 — the wire

Same answer. 8× less data on the wire.

The memory wall is paid in bytes moved. Binary streams 16 bits per weight; ternary streams a trit. Watch the pipes — same result out, 8× less data through.

Live data flow
Live demo · 04 — the one that matters

Your AI estate, at a tenth of the size.

Pick a serving workload. Both halls below have the same floor area and the same throughput — the difference is how much of it ternary silicon actually needs to power. The dark cells are racks you never build.

120 racks

Conventional GPU hall120 racks

every rack powered · 14.4 MW IT load
⚠ entire hall lit, cooled and billed

T3ISC ternary hall — same throughput12 racks

only the bright racks are powered · 1.2 MW IT load
108 racks you never build — floor space freed for growth
−91%
Annual energy
0 GWh
Electricity saved / year
£0M
Running cost saved / year
0 kt
CO₂ avoided / year
−90%
Racks & floor space

Illustrative model: equal token throughput; consolidation from the measured ≈12× energy-per-token ratio (11.6× measured, applied conservatively as 10× rack consolidation); 120 kW per conventional accelerator rack; PUE 1.2; £0.10/kWh electricity only; grid intensity 0.35 kgCO₂/kWh. Total cost-of-ownership savings are larger still — cooling plant, floor space and hardware all shrink with the racks. The ratio is the product; the absolute figures are arithmetic — substitute your own assumptions and the percentages hold.

One platform, many workloads

Not just chat.
A platform for efficient compute.

AI inference is the anchor — but the same silicon reaches the structured-data workloads that dominate the modern data centre. Where the data has structure, the advantage applies. Where it doesn't, we coexist with conventional hosts — and say so.

💬

AI inference

The flagship: ≈12× measured energy per token vs FP16, accuracy preserved. The fastest-growing electricity bill in computing, cut by ~91%.

measured
🎬

Video & media

Core video-processing blocks validated bit-exact at silicon grade. Streaming, transcoding and media pipelines ride the same efficiency substrate as AI.

validated bit-exact
📊

Analytics & databases

Columnar scans and streaming analytics map naturally onto the architecture — component-level measurements on these access patterns reach an order of magnitude lower power against production reference designs.

measured · component level
🕹️

Graphics & rendering

Neural rendering measured at 2.5× performance-per-watt against published figures for a flagship gaming GPU — the advantage extends into graphics, gaming and visual computing.

measured proxy
📱

Embedded & edge

The same architecture scales down to battery budgets — on-device AI, wearables and ambient compute, where every milliwatt is product-defining.

architecture validated
🤝

General-purpose code — the honest tile

Plain server logic doesn't carry the same structure, so we don't claim 12× there — we measure a 1.3× whole-system win, at CPU performance parity. T3ISC coexists with conventional hosts and wins where the advantage is decisive — the Apple-Silicon playbook. Investors hear claims; we publish boundaries.

honest boundary

Workload coverage figures are per-class measurements at the stated grades, not a uniform system-wide multiple. The data-centre savings modelled on this page are anchored to AI inference — the largest and fastest-growing slice — with the structured-data classes above extending the addressable share.

Proof, not promises

Measured. Verified. Adversarially audited.

Deep-tech hardware fails on optimistic numbers. Ours are produced the opposite way: every figure is measured on industry-standard silicon tooling, then attacked by an independent adversarial verification pass before it is allowed into the record. Claims that didn't survive were retired — publicly, in our own audit trail.

11.6×
Energy per AI token vs FP16

Whole-system measurement — full design, real switching activity, end to end on industry-standard silicon tooling. A 91% reduction in the energy each answer costs. The "12×" on this page is this figure, rounded.

measured · silicon-grade
4–12×
AI perf/W vs a leading server-class CPU

The honest composite range — architecture × ternary numerics × system integration, folded conservatively with every overlap removed. We publish the range, not the cherry-pick.

measured composite
1.3×
Whole-system efficiency on general computing

Measured across the integrated system at silicon grade on ordinary, non-AI workloads — with CPU performance at parity with production server-class cores. The efficiency is additive, not traded for speed.

measured · silicon-grade
<1.4%
Accuracy delta, end to end

A 2-billion-parameter ternary model run through the full architecture produces answers within 1.4% perplexity of full precision — bit-exact functional correctness across 1,024 of 1,024 verification vectors.

measured · end-to-end
2.5×
Perf/W vs a flagship gaming GPU

Neural rendering measured on real hardware, compared against published figures for a current flagship GPU — evidence the advantage extends beyond chat into graphics, media and the wider efficient-AI stack.

measured proxy · real hardware
~2–3×
Smaller silicon than a leading data-centre GPU

Modelled die area for the server-class part. Fewer square millimetres per token means more chips per wafer — the margin advantage is built in at manufacture, before the energy savings even start.

engineering model
100%
Software toolchain, proven

New chips die when nothing runs on them. Real programs compile and run on our architecture today, verified bit-for-bit against a standard reference — the "Itanium risk", retired before tape-out.

verified bit-exact
🛡️

Zero unverified claims reached this page. Our verification methodology is engineered to catch its own optimistic errors: independent reviewers re-derive every headline number from the raw artefacts, and any figure that fails is retired and recorded. That audit trail — unusual in this industry — is itself a due-diligence asset. The figures investors test are the figures that survive.

No exotic physics

Printed on the fabs the world already built.

This is not a bet on a new material, a new transistor, or a new fab. T3ISC's ternary architecture is implemented entirely in standard binary CMOS — the same process, the same transistors, the same foundries manufacturing today's chips.

🏭

Standard CMOS process

Manufacturable on production nodes at existing foundries. No new equipment, no new materials, no process risk added to the venture.

📐

Validated with industry tooling

The architecture has been carried through synthesis, place-and-route and energy measurement on the same class of tooling production silicon teams use.

🧠

Runs today's efficient models

Ternary-weight models are published, open art with frontier-lab momentum behind them. The models exist; what's missing is silicon that runs them natively. That's us.

🔌

Drops into binary infrastructure

The architecture speaks standard interfaces and coexists with conventional hosts — adopting it doesn't mean replacing the data centre, just shrinking it.

Ternary efficiency on binary CMOS — same transistor budget, three states.
HOW? THAT'S THE PATENTED PART →
186 years in the making

Ternary isn't new.
Practical ternary is.

Engineers have known for nearly two centuries that three states beat two on efficiency. What was always missing is a way to build it without exotic devices. That's the part we solved — and patented.

1840

Fowler's ternary calculating machine

Thomas Fowler, a self-taught English inventor, builds a working calculating machine in wood — using balanced ternary because it needed far fewer parts than decimal or binary designs of the day.

1958

Setun — the first ternary computer

Nikolay Brusentsov's Setun at Moscow State University runs on balanced ternary — and proves cheaper and more reliable than binary contemporaries. Roughly fifty are built before the project is shut down for standardisation, not for engineering reasons.

1960s

"Perhaps the prettiest number system of all"

Donald Knuth, in The Art of Computer Programming, singles out balanced ternary for its elegance — flagging that hardware might one day return to it. For decades, binary fab economics kept that day from coming.

2024

BitNet b1.58 — AI meets ternary

Microsoft Research shows that large language models run on ternary weights {−1, 0, +1} with accuracy matching full precision — at 1.58 bits per weight. The workload the world most wants to run cheaply turns out to be natively ternary.

2026

T3ISC — ternary silicon you can print today

T3ISC closes the 186-year gap: a ternary architecture implemented entirely on standard binary CMOS — no exotic devices, no new fabs. Patents filed. The precision ladder ends here, and we own the floor.

The opportunity

A focused platform play,
on a proven precedent.

£150–250B

served market by 2032 — AI inference across cloud and edge data centres through embedded, plus the cross-domain workloads (media, analytics, secure compute) where the same advantage applies.

🍎

The Apple-Silicon playbook

Dominate the workloads where a purpose-built advantage is decisive, rather than competing everywhere. Specialist platforms win categories.

⏱️

Timing: the wedge is open now

The industry is moving to radically more efficient models — but is still serving them on general-purpose silicon designed for a previous era. Purpose-built silicon captures an order-of-magnitude gap incumbents cannot close by tuning.

🧱

A three-layer moat

Patent-protected architecture (priority window open) · a working end-to-end software stack competitors consistently underestimate · and a verification discipline that survives due diligence.

Status

Past the hardest unknowns. Into execution.

Complete

Architecture designed & validated

Full system, silicon-grade evidence on every load-bearing component.

Complete

Efficiency independently verified

Adversarial verification across the measured claim set. Zero unverified figures.

Complete

Software toolchain proven

Real programs compile and run, bit-exact against reference. Itanium risk retired.

In progress

Patent portfolio

First applications filed; further filings inside the international priority window.

Invest

Enter at the inflection point.

The deepest technical risks — architecture, efficiency, software — are already retired and independently verified. Capital now converts directly into the tape-out path, on the Apple-Silicon precedent, into a market being rebuilt around energy.

Measured ≈12× energy advantage Verification trail built for diligence Standard CMOS — no fab risk Patents in priority window