Cosmicgpt – A GPT-in-space simulator to research SpaceX AI satellite viability

GitHub - davedx/cosmicgpt: A GPT-in-space simulator to research SpaceX AI satellite viability · GitHub

/" data-turbo-transient="true" />

Search or jump to...

Search code, repositories, users, issues, pull requests...

-->

Clear

Search syntax tips

Provide feedback

--> We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Cancel

Submit feedback

Saved searches

Use saved searches to filter your results more quickly

-->

Name

Query

To see all available qualifiers, see our documentation.

Cancel

Create saved search

/;ref_cta:Sign up;ref_loc:header logged out"}" Sign up

Appearance settings

Resetting focus

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

davedx

cosmicgpt

Public

Notifications You must be signed in to change notification settings

Fork

Star

main

BranchesTags

Go to file

CodeOpen more actions menu

Folders and files NameNameLast commit message Last commit date Latest commit

History 5 Commits 5 Commits

docs

scenarios

scripts

src/cosmicgpt

tests

.gitignore

ARCHITECTURE.md

DESIGN.md

README.md

pyproject.toml

View all files

Repository files navigation

CosmicGPT

Simulate what happens to GPT inference under space conditions — cosmic-ray bit flips and other radiation-induced faults corrupting a model's weights, activations, KV cache, and output.

📊 View the live reports → davedx.github.io/cosmicgpt

See what radiation does to an AI model's output: a single-run report and an environment comparison.

See DESIGN.md for goals and the conditions we model, and ARCHITECTURE.md for the technical design.

Status: visualizations + HTML reports (step 5)

The end-to-end loop covers the full Single-Event-Effect taxonomy across three corruptible regions , with faults either hand-specified or derived from a physical radiation environment : build a seeded nanoGPT (with a real KV cache), generate a clean baseline, get faults (manual or from the flux scheduler), inject them (weight mutations, activation forward-hooks, KV-cache mutations), regenerate with the same sampling seed, and diff.

Fault kinds (--kind): SEU (single bit flip), MBU (multi-bit upset), STUCK_AT (cell pinned 0/1), SEL (latch-up — a whole tensor zeroed), SET (transient activation glitch), SEFI (NaN/garbage cascade). Regions (--region): weight , activation (incl. lm_head → logits), kv_cache . Environments (--orbit): LEO, SAA, POLAR, GEO, INTERPLANETARY, SOLAR_STORM , with an optional solar-flare burst window raising λ(t) mid-inference.

Every run also reports a failure mode (silent_correct / subtle_wrong / repetition / garbage / nan_garbage / crash), time-to-failure , and mean KL divergence of the output distribution, and can emit a per-step RunTrace JSON (the data the upcoming visualizations consume).

# physically-derived faults from an orbit (flux scaled so a short run shows effects) cosmicgpt run --orbit SAA --flux-mult 1e4 --tokens 120 # a mission with a mid-inference solar flare cosmicgpt run scenarios/mission_solar_storm.yaml # write a self-contained HTML report (token diff + degradation timeline + raster) cosmicgpt run --orbit SOLAR_STORM --flux-mult 1e4 --report report.html # regenerate a report from a saved trace — no re-inference cosmicgpt report runs/storm/trace.json -o report.html # compare conditions side by side (View C) cosmicgpt compare --orbits LEO,SAA,SOLAR_STORM -o comparison.html

Reports are fully self-contained (inline CSS + inline SVG, no external assets, no matplotlib) so they're emailable and archivable.

Quickstart

python -m venv .venv && source .venv/bin/activate pip install -e ".[dev]"

# run the smallest scenario (SEU) cosmicgpt run scenarios/walking_skeleton.yaml

# drive the taxonomy directly cosmicgpt run --kind SEFI --n-flips 1 --tokens 120 --fault-seed 3 cosmicgpt run --kind SEL --n-flips 8 --tokens 100

# verify the bit-flip foundation + injection mechanisms pytest

Early findings

Single faults on low-impact sites (biases, low mantissa bits) are routinely masked — realistic: most cosmic-ray hits do nothing visible.

Exponent/sign flips and SEL are far more destructive than mantissa flips.

SET (transient activation glitch) is gentle: without persistence it affects one step, and only if it lands on the emitted position.

The model now has a real KV cache (--region kv_cache): a strike there is mutated once but persists, because every later token re-reads the corrupted entry through attention. Region is independent of fault kind — --region weight|activation|kv_cache.

A single short inference in LEO is essentially fault-free at realistic upset rates; meaningful...

Cosmicgpt – A GPT-in-space simulator to research SpaceX AI satellite viability

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

Apple WWDC 2026 Livestream

Claude Fable 5

US Government directive to suspend access to Fable 5 and Mythos 5

German ruling declares Google liable for false answers in AI Overviews