Open to senior endpoint roles

Endpoint engineer
at 1.8 million-device scale_

CrowdStrike Falcon SME for Amazon's 1.8-million-device Windows, macOS, and Linux corporate fleet. Served as Amazon's primary SME during the July 2024 Channel File 291 global outage — the biggest IT incident in industry history.

1.8M
Devices
3
OS families
16
Years in IT
CF291
Primary SME
About

The engineer the team calls when the platform misbehaves.

Sixteen years in IT. Seven at Amazon. One fleet the size of a small country.
Marco Bruschi

For seven years I've been the primary CrowdStrike Falcon SME for Amazon's 1.8-million-device corporate fleet — Windows, macOS, and Linux, all the way down to the kernel. I served as Amazon's primary SME during the July 2024 Channel File 291 global outage. On a quieter week I'm the engineer the team calls when the platform misbehaves at scale.

I write the runbook, ship the script that fixes the thing, then write the KB article so nobody has to page me again. Looking for a senior endpoint or client-platform role closer to product decisions — shipping tooling that keeps the fleet invisible to the people using it.

Experience

Sixteen years deep in fleet operations.

Seven at Amazon on one of the largest CrowdStrike Falcon deployments in existence. Four before that scaling up from help desk to Senior System Administrator at a health-tech company growing from 20 to 200+ users. Plus MINDBODY, AMETEK, and Cal Poly SLO along the way.

Amazon Web Services — Corporate Security · Endpoint Platform Team

Jan 2019 – Present · San Luis Obispo, CA

Primary SME for CrowdStrike Falcon operations across a 1.8-million-device corporate fleet. Triage, vendor engagement, module rollouts, incident response, production tooling.

Systems Engineer II L5 · Current
Apr 2024 – Present

Served as Amazon's primary SME during the July 2024 Channel File 291 global outage — the biggest IT incident in industry history. Primary SME for Falcon operations across the 1.8M-device fleet: sensor deployment and policy tuning, exclusions, custom IOA and IOC authoring, Spotlight vulnerability management, agent lifecycle, and cross-platform incident response. Partnered on rolling out five new Falcon modules at scale (F4IT, Firewall, Spotlight, Device Control, Installation Tokens). Shipped the Windows repair script that cut team escalations by 90%, and reduced high-CPU ticket intake by 90%+ through KB articles and runbook documentation that moved routine investigations to self-service.

Systems Engineer I L4
Mar 2020 – Apr 2024

Joined the Endpoint Platform team as a Falcon operator to learn the stack end-to-end. Earned CrowdStrike Certified Falcon Administrator (CCFA) to formalize the platform knowledge, then expanded into sensor deployment, policy tuning, exclusions, and vendor case management. Investigated fleet-impacting EDR regressions across Linux, macOS, and Windows — kernel-level performance issues, sensor conflicts with other security agents, and host-visibility bugs — coordinating fixes directly with CrowdStrike engineering.

IT Support Engineer II
Jan 2019 – Mar 2020

Tier-2 endpoint support for Amazon corporate users. Built the grounding in Amazon's corporate endpoint stack that led straight into the Endpoint Platform team.

California Polytechnic State University, San Luis Obispo · IT Operations Specialist Sep 2018 – Feb 2019

Front-line IT operations for Cal Poly SLO — enterprise environment before Amazon.

AMETEK · System Administrator Nov 2016 – Apr 2017

System administration and on-prem infrastructure at a global instrumentation manufacturer.

MINDBODY, Inc. · Senior Operations Center Specialist May 2014 – Dec 2015

Production infrastructure maintenance for a SaaS serving tens of thousands of businesses — OS patching, batch-job monitoring, hardware diagnostics, proactive health.

Conversio Health · Senior System Administrator Apr 2010 – May 2014

Started on the help desk and grew into the Senior System Administrator role over four years, ending with two engineers reporting in. Scaled the infrastructure 10× alongside the company — from 20 employees to 200+ — while keeping PCI DSS audits passing on a twice-yearly cadence. Shipped the company's first ticketing system, monitoring stack, desktop imaging for rapid new-hire provisioning, and a Microsoft Exchange → Office 365 migration that was its first production cloud workload. First role where I learned that good ops is invisible.

Selected work

Three years, four ships.

The moments over the last few years where the platform either broke badly or got meaningfully better. Deep-dive writeups coming soon.
Incident Response 2024

Channel File 291 Global Outage

Served as Amazon's primary SME during the worldwide Falcon Channel File 291 outage — the biggest IT incident in industry history. Supported triage across the fleet, authored the post-incident runbook for pausing channel-file updates, and filed the feature request that CrowdStrike shipped back into the product.

Amazon's primary SME · Fleet-wide triage · Product feedback shipped
Writeup coming soon
Platform Ownership 2020–2026

1.8M-Device Fleet Ownership

Primary SME for CrowdStrike Falcon across Amazon's Windows, macOS, and Linux corporate fleet. Triage, root-cause, policy, vendor escalation, module rollouts — the full surface area of EDR at a scale very few engineers get to touch.

1.8M devices · 3 OS families · 11+ CIDs
Writeup coming soon
Production Tooling 2025

Repair Script & FixAll Package

Shipped a Windows repair script and FixAll package that replaced a 6 GB legacy remediation payload with a consumable “Repair Lite” build. Integrated into the enterprise software distribution catalog and deployed across the Windows fleet.

−90% team escalations · 6 GB → consumable
Writeup coming soon
Cost Optimization 2025

Telemetry Pipeline: 3B → 30M Events/Day

Caught a telemetry script generating 3 billion daily events on macOS devices. Partnered with the data-ingestion team on a 99% reduction — 3 billion down to 30 million per day — materially cutting downstream cost.

99% reduction · Cross-team partnership
Writeup coming soon
Skills

What I actually use every day.

Nothing on this list is aspirational. If it's here I've shipped it in production at Amazon-scale.

Endpoint Platforms

  • CrowdStrike Falcon EDR
  • Falcon Spotlight
  • Falcon Device Control
  • Falcon Firewall
  • Falcon for IT
  • FortiDLP / Reveal

Operating Systems

  • Windows Server & Client
  • Windows on ARM
  • macOS
  • Ubuntu
  • Amazon Linux

Cloud & Infra

  • AWS Lambda
  • S3
  • SQS
  • CloudWatch
  • IAM
  • SSM

Deployment & Config

  • SCCM
  • JAMF
  • Tanium
  • Enterprise software catalogs

Languages

  • PowerShell
  • Bash
  • Python

Incident & Response

  • Root-cause analysis
  • Vendor engagement
  • Runbook authoring
  • Cross-team coordination
  • CrowdStrike RTR

Certifications

  • CrowdStrike CCFA
  • GIAC GISF