AI-augmented · Senior-led offensive security

AI flags what’s possible. Operators prove what’s real.

AI maps your attack surface at machine scale. Then senior operators reproduce, chain, and rank every finding by hand. You get proof and a prioritized fix list — never a scanner dump, never an unverified “AI critical.”

Brief a senior operator See a sample finding

100% human-reproduced before delivery· 24–72h to first validated finding

Vulnerabilities responsibly disclosed to

Google
Anthropic
American Express
Under Armour
Naver
Cursor
CodeRabbit
Mintlify
Imena & more

sid: t-a

GET /api/v1/exports/10412200 · yourst-a
GET /api/v1/exports/10413200 · yourst-a
GET /api/v1/exports/10414200 · yourst-a
GET /api/v1/exports/10415200 · yourst-a
GET /api/v1/exports/10416200 · yourst-a
GET /api/v1/exports/10417200 · yourst-a
GET /api/v1/exports/10418200 · yourst-a
GET /api/v1/exports/10419200t-b

IDOR · cross-tenant read · PROVEN

01 / The problem

A scanner dump isn’t a pentest. An unverified “AI critical” isn’t proof.

Scanners and autonomous agents produce volume — candidate “criticals,” confidence scores, false positives. That’s a backlog, not a verdict.

02 / Method

Two temperatures. One handoff.

The machine drafts at scale; the operator decides. That one handoff is the whole method.

Machine layer · recon at scale

Your attack surface is bigger than your findings list. The machine enumerates every candidate — the operator proves the few that are real.

Machine layer · AI

maps the attack surface
enumerates assets & endpoints
flags candidate weaknesses
scores its own confidence
runs continuously, at machine scale

Operator layer · human

reproduces every candidate by hand
builds the working exploit
chains findings to real impact
drops the false positives
ranks by business risk

Autonomous tools give you one layer — the machine. We give you two — the machine, plus a senior operator who proves its work. That operator is the guarantee the tools removed.

Verification queue

Candidate · surfaced by machineClassOperator verdict

GET /api/v2/export?id=… · IDORAuth Verified · High
SMTP header injection · low confidenceEmail Dropped — false positive
JWT alg confusion → admin forgeAuth Chained → Critical
/debug exposed in productionInfo Verified · Medium
Rate-limit “bypass” · replayLogic Dropped — false positive
Subdomain takeover · dangling CNAMEDNS Verified · High
Verbose stack trace on 500Info Dropped — low signal

7 candidates → 4 proven · 3 dropped

03 / Services

What we test.

Six practice areas, each run machine-plus-operator.

Web & API / AppSec

Full-depth testing of web apps and APIs: authentication and access control, business logic, injection, session handling — beyond any scanner’s reach.

machine + operator

Cloud & Infrastructure

AWS, GCP, Azure and hybrid estates: IAM paths, misconfigurations, lateral movement, privilege escalation — proven, not presumed.

machine + operator

External & Network

Your perimeter as an attacker sees it: exposed services, forgotten hosts, exploitable network paths.

machine + operator

Red Team & Adversary Simulation

Objective-driven campaigns that test detection and response, not just prevention.

operator-led

Mobile

iOS and Android: client-side storage, transport security, API trust, platform-specific abuse.

operator-led

LLM & AI Application Security

Prompt injection, unsafe agent tool-use, data-exfiltration paths, and the access controls around your model — tested by operators who build with these systems daily.

machine + operator

04 / Process

Six steps, scope to sign-off.

Senior-led throughout — and we re-test your fixes before anything is called resolved. Only step two is the machine’s.

Engagement phases in order, scope to sign-off. Step two is machine-run; every other step is operator-led.
#	Phase	Actor	What happens
01	Scope	operator	A senior operator scopes the engagement with you — targets, rules, goals.
02	Recon & map	AI	Our AI enumerates and maps your attack surface at machine scale.
03	Verify & exploit	operator	Operators reproduce each candidate by hand and build working exploits.
04	Impact analysis	operator	Findings are chained and ranked by real business impact — not raw CVSS.
05	Report	operator	Proof, reproduction steps, and a prioritized fix list — written by the operator who did the work.
06	Retest & sign-off	operator	Free fix re-test within 30 days. The engagement closes with a human sign-off.

05 / Proof

What a finding looks like when it’s real.

Finding PS-SAMPLE-01 Sev High · CVSS 8.1

IDOR in export endpoint, chained to cross-tenant data access.

Target:

Verification log

m 03:12 candidate surfaced by recon model — export endpoint accepts arbitrary object id
o 09:47 reproduced by hand — 3 steps, cross-tenant read confirmed
o 11:02 chained: id enumeration → bulk export → tenant data

Reproduction

POST /api/v2/export {"object_id":""}
swap object_id for another tenant’s id
response returns foreign-tenant records — full export

Impact

Any authenticated user could export another organization’s records. Ranked HIGH: direct data exposure, trivial to script, no privileged access required.

Fix

Enforce object-level authorization on every export path; verify tenant binding server-side. Re-tested and confirmed closed at step 06.

Verified · reproduced by hand Operator note — the scanner saw one exposed id; we proved it drained a tenant.

Want a finding like this on your stack — proven, not presumed?

Brief a senior operator

06 / Commitments

What we promise, and what we won’t.

of findings reproduced by a senior operator before delivery

24–72h

to first validated finding

0-day

free fix re-test on every engagement

Testing aligned with: OWASP ASVS · OWASP Top 10 · NIST SP 800-115 · PCI DSS Supports your SOC 2 & ISO 27001 evidence.

Built for teams in FinTech, SaaS, Healthcare, Crypto & Web3, and the public sector.

07 / Contact

Brief a senior operator.

Tell us what you’re building and what you need tested. Our team — not sales — replies within one business day.

A senior operator scopes the work with you — NDA on request.
A 30-minute call, no obligation and no sales script.
A clear quote and timeline before any testing begins.

hello@pentestshell.com · security@pentestshell.com