Detection Engine — Executable Roadmap
GeaSpirit fuses computational materials science with satellite remote sensing to detect mineral prospectivity from public-access data. This page is the engine's working logbook — schematic and honest: what is validated, what failed, where the real ceiling is, and the road to a sellable exploration report and, ultimately, a mining-asset marketplace.
Operating philosophy. Make → Measure → Fix → Repeat. Every sprint must produce something demonstrable. Stack: 100% public-access data and open tools (Sentinel-2/1, Landsat, SRTM, EMIT, USGS / national geological surveys, Macrostrat, Google Earth Engine). No invented figures, ever.
1. The pipeline
For any area of interest the engine runs a fixed sequence and outputs a ranked target list with confidence:
- Acquire — pull public-access satellite, terrain, geophysics and geology layers over the AOI.
- Feature build — spectral mineral indices, thermal climatology, terrain/structure, geology, vegetation.
- Type-aware fusion — select the optimal feature subset for that deposit type and zone.
- Score & calibrate — probability map with honest, calibrated confidence (spatial cross-validation).
- Rank & report — ranked targets with exact coordinates, evidence breakdown and limitations.
2. Sprint plan (0 → sellable report)
| Sprint | Goal | Key metric |
|---|---|---|
| 0 | Setup & public data acquisition | Load a Sentinel-2 scene |
| 1 | First mineral-index maps | Visual correlation with known deposits |
| 2 | First ML classifier | AUC > 0.65 |
| 3 | Multi-sensor fusion | AUC > 0.72 |
| 4 | Vegetation as a sensor | +2% AUC from vegetation |
| 5 | Transfer learning | AUC > 0.60 on a new zone |
| 6 | Deposit-model integration | 3/5 known deposits found |
| 7 | First sellable report | Positive geologist feedback |
3. Validated results — multi-zone, multi-deposit, multi-continent
Conservative, leakage-resistant spatial cross-validation AUC. Higher is better (0.5 = random).
| Zone | Deposit type | Baseline | Full fusion |
|---|---|---|---|
| Chuquicamata (Chile) | Porphyry Cu | 0.79 | 0.88 |
| Kalgoorlie (Australia) | Orogenic Au | 0.81 | 0.94 |
| Zambia Copperbelt | Sediment-hosted Cu | 0.74 | 0.76 |
| Peru | Porphyry Cu | 0.70 | 0.76 |
| Arizona (USA) | Porphyry Cu | 0.72 | 0.72 |
Fusion is not universal: it lifts strong-baseline zones and can hurt weak ones. The engine therefore selects features per zone and per deposit type — there is no single planet-wide formula.
4. Feature families — what is actually validated
| Family (public-access source) | Verdict |
|---|---|
| Satellite spectral baseline (Sentinel-2 / Landsat) | Core — always included |
| 20-year thermal climatology (Landsat) | Validated, universal & modest |
| EMIT hyperspectral alteration (NASA) | Validated — porphyry Cu only |
| Geology / lithology (Macrostrat) | Validated selective (3-zone evidence) |
| Aeromagnetics (national surveys) | Selective — small but real lift |
| Multi-decadal NDVI / vegetation (Landsat) | Selective — vegetated zones |
| Terrain / spatial gradients | Redundant with spectral — not added |
5. Key learnings
- Labels dominate. Same satellite data, Kalgoorlie went from 0.58 → 0.77 just by going from 16 to 205 curated deposit labels.
- Deposit type > commodity > geography. Training on pure types sharpens the model; mixing types confuses it.
- Fusion needs a floor. A zone needs a baseline AUC ≈ 0.73 before extra layers help rather than hurt.
- Transfer is zone-specific. Satellite features encode geography, not transferable geology — each zone earns its own stack.
- The gap is data, not ML. Surface screening is near its ceiling; the next gains come from new public-access layers.
6. Honest scorecard
We grade the engine against a fixed four-dimension objective. This is the real number, not a marketing one:
| Dimension | Score | Status |
|---|---|---|
| Mineral identity | 4.0 / 10 | Satellite encodes geography, not mineralogy |
| Depth | 4.1 / 10 | Needs subsurface geophysics (gravity / AEM) — access-blocked |
| Coordinates | 7.0 / 10 | ~30 m resolution, ~1 km² zones |
| Certainty | 7.7 / 10 | Best calibrated Brier ≈ 0.096 |
| Total | 22.8 / 40 (57%) | Surface screening solid; depth is the frontier |
Path to the top. Satellite only caps near 55%. Adding public-access geophysics lifts it toward 75%; adding drill-hole data toward 90%. A true 100% needs a field campaign — which is exactly what a GeaSpirit report is designed to de-risk and prioritize before a single dollar is spent in the field.
7. Limitations
- Remote sensing and public data screen opportunity; only drilling confirms a deposit.
- Scores are prioritization indicators — not probabilities of economic mineralisation, and not valuations.
- Depth estimation is limited by access to subsurface geophysics, much of which sits behind manual portals.
- GeaSpirit provides intelligence and prioritization, never investment, legal or technical advice.
8. Where this is going
- Sellable screening reports — "at these coordinates, this mineral is probable, at this confidence, with this evidence."
- Global coverage — scale the public-access database toward every mine and prospect on Earth.
- Mining-asset marketplace — connect owners, explorers and investors (individuals & institutions) around evidence-scored assets and potential lands, with on-chain provenance via SOST.