For SANS Find Evil 2026 judges
Sift Sentinel is our entry to the SANS Find Evil 2026 hackathon. It is an autonomous DFIR agent that examines a Windows disk image and reports persistence mechanisms with no human in the loop. This page is your map into the live submission. Everything below the fold links into a working artifact, not a screenshot.
Walk-through (3 minutes)
If you only have time for four clicks, here is the order that tells the whole story.
Required submission components
The 8 components Devpost requires, with their current status and where to find each one. SUCCESS means the artifact is live and judge-readable; STAGED means the source content exists and the assembly step is pending.
docs/submission/devpost-description.mddocs/submission/Accuracy at a glance
23 cases, 71 findings, zero hallucinations. The pipeline has been run end-to-end against 23 distinct Windows hosts across three channels (disk-only, dual-channel disk + memory, and memory-only), with the Critic catching every claim before commit.
Cases by ground-truth strength
Highest authority first. The single externally-validated case carries the strongest precision claim; owner-annotated cases are informed by community write-ups; memory-channel approvals are individually reviewed by the project owner. Disk-only auto-commits land when the Critic's deterministic rules accept every finding without retry.
| case | channel | status | findings | high-conf | memory |
|---|---|---|---|---|---|
| dfirmadness-001-desktop externally validated · DFIR Madness public answer key | disk | committed | 2 | 2 | 0 |
| srl-2018-wkstn-05 owner annotated · community writeups | disk | committed | 2 | 2 | 0 |
| srl-2018-wkstn-05 owner annotated · dual channel | disk + mem | committed | 5 | 3 | 3 |
| srl-2018-base-rd-05-memonly memory channel · approved 2026-05-03 | memory | approved | 4 | 3 | 2 |
| srl-2018-base-wkstn-03-memonly memory channel · approved 2026-05-03 | memory | approved | 7 | 0 | 3 |
| srl-2018-base-wkstn-06-memonly memory channel · approved 2026-05-03 | memory | approved | 3 | 0 | 1 |
| srl-2018-base-rd-03-memonly memory channel · approved 2026-05-03 (LARIAT framework + masquerading services) | memory | approved | 3 | 0 | 0 |
| srl-2018-base-wkstn-04-memonly memory channel · approved 2026-05-03 | memory | approved | 4 | 0 | 2 |
| srl-2018-base-rd-01-dual cross-channel correlation | disk + mem | committed | 4 | 2 | 2 |
| srl-2018-base-rd-02-dual cross-channel correlation | disk + mem | committed | 3 | 3 | 1 |
| srl-2018-base-file-dual cross-channel correlation | disk + mem | committed | 4 | 4 | 2 |
+ 12 more cases (disk-only and dual-channel) ending in SUCCESS, including base-dc (negative control, no positives to predict) and the rest of the SRL workstation set. Full per-case breakdown in the run viewer. | |||||
A second accuracy track ships continuously: the daily synthetic-workstation loop. Every night Haiku reads recent threat-intel news, plants synthetic versions of the tradecraft on a baseline image, and the pipeline scores against the planted manifest. That data is on the Today's run page and accumulates without further engineering effort.
What is the autonomy story
The pipeline ships at L2 (Guarded Execution): the agent self-corrects via deterministic Critic rules and a bounded retry budget, with humans gating the initial plan and final findings. Submission target is L3 (Exception-Based Autonomy): only Low-confidence findings or fail-fast events pause for human review.
The architecture page shows the autonomy climb in the chip row at the top: L1 assisted (shipped), L2 guarded (shipped, the current state), L3 exception (the goal for the submission cut on the bounded reference dataset).