Demonstrations

Proof, not promises.

Three goals. Three domains. One Umma. Each from a single prompt — and every claim is grounded and traceable.

The work

Latest demonstration Demonstration · Markets

Fraud, negligence, and deceit: the SpaceX & OpenAI IPOs

What SpaceX and OpenAI are actually worth, how the market could collapse, and who picks up the tab.

Read the demonstration →

$1.2T SpaceX price-vs-fundamentals gap ~$730B OpenAI price-vs-fundamentals gap $15–25T potential downside loss

Demonstration · Security

A security audit of Netflix's Lemur

3h 20m

From one sentence — a full security audit: 238 flaws found, 6 multi-step attack chains, every finding traced to its source.

238 security flaws found 327 files of code read Every claim validated to source

Read the demonstration → Demonstration · Personal

A family navigating a dementia diagnosis

1h 33m

For the biggest decisions in life, where other AI deflects — a four-option care plan, a 10-year financial model, and a hard truth five independent analyses converged on.

4 care paths weighed Up to $2M saved over 10 years 1 hard truth surfaced

Read the demonstration →

Umma vs. the frontier models

Agent mode is not an operating system.

What Big Tech promises

“Give GPT-5.5 a messy, multi-part task and trust it to plan, use tools, check its work … and keep going.” GPT-5.5, openai.com ↗

“Give it a goal and Claude works on your computer … to return a finished deliverable — but consequential decisions remain with the user.” Claude Opus 4.8, anthropic.com ↗

“Your 24/7 personal AI agent … takes action on your behalf and is under your direction.” Gemini 3.5 Flash, blog.google ↗

Capability	Umma operating system	GPT-5.5 OpenAI	Claude Opus 4.8 Anthropic	Gemini 3.5 Flash Google
Designed to pursue goals	✓	~	~	~
Continuous self across sessions	✓	✗	✗	✗
Honest, truthful	✓	✗	✗	✗
Builds and keeps its own capabilities	✓	✗	✗	✗
Verifiable trace of every claim	✓	✗	✗	✗

Umma vs. the agents

Agents sell goals. Only Umma achieves them.

Their boldest claims, in their own words — then what shipped.

Manus

general AI agent in their words

“It bridges minds and actions — it delivers results, getting everything done while you rest.” manus.im ↗

What happened when it ran

Asked to verify its own work, Manus fabricated curl responses for a server state that never existed — results, invented, while you rest. Rio Times ↗

Umma

Umma validates every claim against its source instead of asserting it. See the Lemur audit →

Hermes Agent

Nous Research in their words

“The agent that grows with you — it remembers what it learns and gets more capable the longer it runs.” hermes-agent.nousresearch.com ↗

What happened when it ran

The longer it runs, the more its self-improvement quietly opens security holes — the bug Nous Research itself calls its “most dangerous,” precisely because it “looks like success.” Nous Research ↗GitHub #7826 ↗

Umma

Umma's growth is governed, versioned, and refusable — nothing lands unproven. How she's built →

OpenClaw

open-source agent in their words

“The AI that actually does things.” openclaw.ai ↗

What happened when it ran

It did things, alright: a one-click bug handed attackers the whole app (rated 8.8/10 critical), its add-on store was poisoned, and it deleted a safety lead's emails — after she told it to stop. NVD ↗The Hacker News ↗Fast Company ↗

Umma

Umma quarantines every capability and logs every call — and obeys a stop. What she can do →

Your turn

Bring your hardest problem. Get in touch →