SOAR Automation · foyl Learn

01 SOAR Fundamentals REFERENCE The three pillars of modern SOC automation

The Three Pillars

Orchestration

Connect tools so they can talk to each other

Automation

Run repeatable tasks without analyst intervention

Response

Execute containment at machine speed

Alert Fatigue

A typical enterprise SOC receives 1,100+ alerts per day. Analyst bandwidth handles ~35%. The other 65% go uninvestigated.

Key Concept

SOAR does not replace analysts. It handles repeatable, time-sensitive, low-judgment tasks — freeing analysts for investigations that require human context.

Ficsit Inc. · Alert Velocity (Live)

184

alerts in the last 4 hours

Alert Breakdown by Source

Discussion

If each of these 184 alerts takes 4 minutes to manually triage, how many analyst-hours does that represent? Answer: ~12 hours — more than one analyst's full shift, before doing anything else.

SOAR Fundamentals — Instructor Notes

Stress that SOAR is a process multiplier, not a headcount replacement. The three pillars have different skill implications: Orchestration requires understanding tool APIs; Automation requires playbook logic and testing discipline; Response requires security judgment to design safe automated actions. Many orgs buy SOAR and only use the Notification layer because "what if automation makes a mistake?" The Ficsit incident is the worked example of scoping containment safely — isolate endpoint: yes, delete user account: no.

Discussion Exercise

At what point in an incident should a human take over from automation? The answer depends on blast radius: automated session revocation (reversible) is safe even at low confidence. Automated endpoint isolation (business-impacting) should require higher confidence. Ask students to propose confidence thresholds for each of the 8 containment actions run during IRON CHIMNEY.

02 Alert Enrichment Pipeline Foyl SOAR From raw alert to investigation-ready in seconds

Raw Alert — ALT-2024-9821

{
  "alert_id": "ALT-2024-9821",
  "source": "Foyl SIEM",
  "type": "AiTM Phishing Detected",
  "severity": "CRITICAL",
  "timestamp": "2024-11-14T09:48:33Z",
  "src_ip": "185.220.101.47",
  "user": "marcus.chen",
  "host": "RESEARCH-STATION-01"
}

Enrichment Output

Click "Enrich +" to step through the enrichment pipeline.

Alert Enrichment — Instructor Notes

A human analyst gathers this same context in 8–12 minutes. SOAR does it in 3.2 seconds, every time, with no fatigue. Teaching point: enrichment quality depends on intel quality. If TIP has no data on the attacker IP, the risk score drops and the playbook routes to "low confidence review" — not auto-contain. SOAR enrichment exposes gaps in your intel program: consistent "no TIP data" responses tell you your feeds aren't covering relevant threat actors.

Critical Thinking

Which enrichment step is most critical for the Ficsit incident? The IP reputation check. If VirusTotal returned "benign," the playbook routes to manual review — marcus.chen's sessions are NOT revoked automatically, and the attacker has more time. Ask: what does this mean for organizations that only use free VirusTotal tier (rate-limited lookups)?

03 Playbook Anatomy Foyl SOAR Triggers, conditions, actions, and notifications

Node Types

Trigger

Starts the playbook

Condition

YES/NO branch gate

Action

Calls an integration

Notification

Alerts without acting

Delay

Pauses execution

End

Closes the run

← Select a node type

Playbook Anatomy — Instructor Notes

The most common beginner mistake: over-automating the Response layer before Orchestration and Automation are reliable. Build confidence progressively — start with Notification-only playbooks, then add enrichment Actions, then soft containment (force MFA step-up), then hard containment (isolate endpoint). The Delay node is underused but important: it enables semi-automated approval flows where a human must confirm before a high-blast-radius action executes.

Design Exercise

Design a playbook for BEC-001 (CEO wire fraud). What's the trigger? What enrichment steps? At what point should human judgment be required? Hint: every Response action in BEC-001 (notifying the bank, reversing the wire) requires human authority SOAR cannot have. This playbook should end with a Notification, not an Action.

04 Phishing Response Playbook Foyl SOAR · PB-001 Step through IRON CHIMNEY — AiTM Phishing Response

● PB-001 Ready to run

Execution Log

Phishing Playbook — Instructor Notes

The key teaching moment is Step 4 — the 85% confidence threshold. This branch is not about whether the alert is real (SIEM already fired CRITICAL). It's about whether enrichment data is reliable enough to trigger automated containment without a human review. Step 9 (EDR corroboration before endpoint isolation) is equally important: by requiring an EDR corroboration, the playbook avoids isolating hosts based on email evidence alone, which has a much higher false positive rate.

Pause Point — Step 10

At Step 10 (Isolate Endpoint), pause and ask: "Who should be notified BEFORE this action runs in production?" Expected answers: the endpoint owner's manager, the SOC shift lead, IT on-call. What does PB-001 currently do? Notify #soc-critical AFTER the fact. Is that order correct? This surfaces the "notify before vs. after action" design pattern debate — a real governance question in enterprise SOAR deployments.

05 Automated Containment Actions Foyl SOAR · PB-001 What SOAR did automatically in 142 seconds — IRON CHIMNEY

8

Actions Executed

4

Tools Called

142s

Total Duration

P1

Case Created

Isolation Delay

Actions 7–8 ran 100+ seconds after actions 1–6. This reflects the EDR corroboration condition in PB-001: the playbook waited for a correlated EDR alert before authorizing endpoint isolation. Without corroboration, actions 7–8 would have been skipped entirely.

What SOAR Did Not Automate

Interview the user · decide whether to re-enable the account · notify legal/compliance · approve wire transfer reversal · determine data exfiltration scope. These require human judgment and organizational authority.

View EDR detection that triggered isolation →

Automated Containment — Instructor Notes

142 seconds vs. 20–40 minutes of manual response represents a 10–15x reduction in attacker dwell time. At 3am when an on-call analyst is woken from sleep, that estimate doubles. Ask: what single action has the highest risk of causing a business outage? Endpoint isolation — if RESEARCH-STATION-01 were running a time-sensitive financial process, isolation would interrupt it. This is why the EDR corroboration gate exists.

Class Exercise

Would you add any additional gates before endpoint isolation? Suggested: check if the host is within critical business hours window, check if a critical business process is running. How would you implement that check using the SOAR integration catalog?

06 Case Creation & Handoff Foyl SOAR · SIEM INV-2024-0087 created in 142 seconds — analyst picks up pre-enriched

INV-2024-0087

IRON CHIMNEY

CRITICAL OPEN P1 · 4h SLA

created2024-11-14 09:50:57

created bySOAR · PB-001

assignedpioneer

playbookPB-001

Key Entities

USER · marcus.chen HOST · RESEARCH-STATION-01 IP · 185.220.101.47 DOMAIN · invoices-ficsit.io

Open in SIEM Investigation →

Automated Case Timeline

Analyst Handoff

SOAR creates the case with full enrichment context and the playbook execution log already attached. The analyst who picks it up at 09:51 doesn't re-gather context — it's already there. They go straight to investigation: what data left the network?

Case Creation & Handoff — Instructor Notes

The handoff moment (09:51:10 — pioneer assigned) is a critical design point. A poorly designed SOAR case arrives with only raw alert data, forcing the analyst to repeat the first 3 minutes of context-gathering. A well-designed case arrives pre-enriched. This is the difference between SOAR as a force multiplier vs. SOAR as just a ticketing system. Emphasize what the analyst can do IMMEDIATELY upon assignment that they couldn't have done without SOAR: the attacker IP is already blocked, the endpoint is already isolated. They can focus entirely on the investigation question.

Class Exercise

What should manual investigation focus on after SOAR's containment? Expected: verify isolation is complete, check lateral movement to other hosts, identify data accessed on RESEARCH-STATION-01 during the 17-minute dwell window (09:31–09:48), determine if marcus.chen credentials accessed any other system.

07 Integration Architecture Foyl SOAR 12 connected tools — the nervous system of automated response

Ficsit Inc. Integrations

Integration Architecture

Each integration registers its actions (what SOAR can tell it to do) and triggers (what events it sends to SOAR). Playbooks wire triggers to action sequences. Any event in any tool can automatically trigger any action in any other tool — without custom code.

Integration Architecture — Instructor Notes

12 integrations is a mature deployment — most orgs start with 3–4 (SIEM, endpoint, Slack) and expand over time. The principle of least privilege applies to SOAR: it should only have the specific permissions it needs, not admin access to each tool. Ask why the VM integration has only 6 actions: it's write-constrained by design — SOAR creates tickets and updates status, but scanner credentials and scan schedules should not be accessible from the SOAR platform.

Extension Question

Which integration would you add to Ficsit's SOAR next? Good answers: CMDB (richer asset context), HR system (check if user is on leave before account actions), AWS/Azure (the 10.0.1.x cloud workloads have no SOAR integration today — a gap if an attacker reaches the cloud segment).

08 SOAR Metrics & ROI Foyl SOAR · Analytics Measuring the automation dividend — Ficsit 30-day view

Automation Rate

71%

of alerts handled without human triage

Industry avg: 48%

MTTR Reduction

−68%

Mean time to respond vs. pre-SOAR baseline

43 min → 14 min average

Playbook Runs

1,247

total this month

47 avg/day · 3 currently failing

Alert Volume

34,122

ingested alerts this month

↑ 12% vs. last month

Analyst Hours Saved

847h

estimated time reclaimed

~5.4 FTE-weeks this month

False Positive Rate

8.3%

of auto-contained alerts required rollback

Below 10% threshold — pass

Playbook Performance

Playbook	Runs	Success	Avg Duration	Last Run
PB-001 · AiTM Phishing Response	47	94%	142s	2024-11-14
PB-002 · MFA Fatigue Lockdown	23	96%	89s	2024-11-13
PB-003 · Endpoint Isolation	31	100%	34s	2024-11-14
PB-004 · BEC Wire Fraud Alert	8	88%	71s	2024-11-10
PB-005 · Vuln Critical Ticket	156	98%	12s	2024-11-14

What to Automate

Automate when: the action is deterministic, the blast radius of a mistake is recoverable, volume is too high for humans, and speed matters more than nuance. Automate: email quarantine, IP blocks, session revocation. Do not automate: wire transfer reversal, account deletion, public communications.

Automation Debt

Every playbook is a liability as well as an asset. Detection rules change, tools get updated, APIs deprecate. The 3 failing playbook runs this month represent unacknowledged automation debt. Budget 15–20% of initial build time per quarter for playbook maintenance.

ROI Exercise

If SOAR saves 847 analyst-hours/month and a SOC analyst costs $85/hour fully loaded, what is the monthly ROI? ($71,995/month) If the SOAR platform costs $120,000/year, what is the payback period? (Under 3 weeks of monthly savings.) But — is "847 hours saved" a direct headcount cost or an opportunity cost? How does the answer change your budget conversation with the CFO?

SOAR Metrics & ROI — Instructor Notes

Push students on the ROI assumptions. "847 hours saved" as a direct cost implies headcount would decrease — which it won't if the SOC is already staffed. Honest framing: "SOAR lets our 4-analyst team handle alert volume that would otherwise require 7 analysts, while reducing MTTR 68%." The 8.3% false positive rate is acceptable for session revocation (reversible) but would be catastrophic for endpoint isolation (dozens of incorrectly isolated hosts per month). Per-action false positive budgets are a mature SOAR governance concept worth introducing here.

Discussion

What would happen to the false positive rate if Ficsit reduced PB-001's confidence threshold from 85% to 70%? Expected: automation rate increases, MTTR decreases, false positive rate increases — likely past the acceptable threshold. This is the core SOAR tuning tradeoff: more automation = more mistakes. The right balance depends on the reversibility of each action and the business's risk tolerance.