① Model Selection
v6.1② Alert Input
Upload a .json array or .csv file (up to 25 alerts for UI; use pipeline.py for larger runs).
Verdict
—Run an investigation to see the verdict here.
Autonomous Tier-1 Triage · 1 Lead + 3 Parallel Subagents · MSCS 670 Agentic AI
Upload a .json array or .csv file (up to 25 alerts for UI; use pipeline.py for larger runs).
Run an investigation to see the verdict here.
Run an investigation to see the triage summary here.
One-sentence conclusion appears after investigation.
Alert metadata appears here after investigation.
Main drivers of the classification appear here after investigation.
Classification sensitivity. Alerts with malicious probability at or above the threshold are labelled Malicious; below are Benign.
The agent's full reasoning chain will appear here after investigation.
ATT&CK tactic & technique matrix appears here after investigation.
The agent's subagent trace will stream here during investigation.
Alerts you investigate will appear on this timeline.
Awaiting investigation…
Signal mix appears here after investigation.
Run an investigation to see the flow.
A step-by-step waterfall showing how the risk score was built will appear here.
Summary appears after investigation.
| Time | Alert ID | Event Type | Source IP | Destination | Verdict | Conf | Tokens |
|---|---|---|---|---|---|---|---|
| Loading incident history… | |||||||
150 Benign / 150 Malicious · Primary evaluation dataset. Two Deep Agent models compared.
10 Benign / 10 Malicious · Quick sanity test used to evaluate model candidates before committing to full benchmarks.
225 Benign / 225 Malicious · Previous-generation benchmark run on the classic ReAct architecture before migration to Deep Agent.
Pick a format for the alert currently loaded in the Verdict panel. Summary is a compact executive-style brief; Full includes the complete subagent trace and raw reasoning.
Run an investigation in the Investigate tab first to enable report generation.
No reports yet. Generate one from the card above.
An autonomous Tier-1 SOC analyst that classifies security alerts as Malicious or Benign.
Built on the LangChain deepagents framework — every alert goes through a lead orchestrator
plus three parallel subagents and a calibrated risk scorer. No pre-agent bypass.
task calls in a single turncheck_network_contextanalyze_log_patterncheck_ip_reputationquery_threat_intelligencegeolocate_and_check_travelgoogle/gemini-2.5-flashBest overall accuracy and cost-performance balance. Used as primary.
openai/gpt-4o-miniFallback for when Gemini is unavailable. Slower and slightly less accurate.