Ten questions, four scenarios, one shortlist test. Written for Safety Managers, Quality Managers and Heads of Compliance who've sat through one too many vendor demos that look great until you read the contract.
The deepest architectural divide in aviation safety platforms is whether they treat records as connected nodes in one operational picture, or as isolated rows in per-module databases. The first lets a Safety Manager open an occurrence and see the related audit findings, the training competency that lapsed, and the corrective action in flight — without an integration project. The second produces a four-tool stack where every cross-module question becomes a manual reconciliation.
The verification test: ask the vendor to show you a finding from their QMS module that links back to the SMS occurrence that created it, and forward to the training record that closed the loop. If the demo requires a copy-paste between modules, you have a four-tool problem disguised as one tool.
The audit-replay scenario: in three years, the regulator opens an old occurrence and asks why your team classified it the way they did. If the AI gives a different answer on that re-run than it did at the time, you cannot defend the original decision — and your audit trail is broken.
The verification test:ask the vendor to re-run the AI classification on the same occurrence three times in front of you. If the output differs even slightly, the AI is not designed for an aviation safety audit — you would have a compliance problem you didn't know you were buying.
The right posture: AI calibrated against your enumerated taxonomy, outputs that are consistent and reviewable, every run replayable, every output citing the source records it referenced, and a human always validating before the result becomes part of the operational picture.
The aviation safety platform you choose holds your most sensitive operational data — the occurrences your crew reports, the audit findings, the training non-compliances, the regulator submissions. Three questions, three yes/no answers:
Weasel words on any of those three are red flags. "We have controls in place" is not the same as "enforced at the database." "Generally we don't" is not the same as "never."
ICAO-aligned should mean a platform that ships with the actual taxonomies — ICAO occurrence categories enumerated, the standard 5×5 risk matrix configurable to your organisation, the Annex 17 threat catalogue, the Annex 19 SMS pillar structure. It should NOT mean a marketing badge with no traceability into the data model.
The verification test:ask which Annexes they cite, in what current version, and how they handle amendments. A platform that handles ICAO Annex 17 amendments as a structural change-impact analysis (each amendment ripples through dependent requirements, policies, training records, and evidence) is a different product from one that put "Annex 17 ready" on a slide.
Ask for a written onboarding plan with weekly milestones and named deliverables. If the plan starts at "kickoff workshop" and ends at "go live" with three boxes in between, push back. A real plan looks like this:
If a vendor needs 12 months, ask what the other 10 months cover. Usually it's their data-model design happening on your time and budget.
A corrective action is not closed when implementation finishes. It is closed when a named verifier confirms — with evidence — that the action actually reduced the risk it was designed to address. EASA Part-ORO and FAA Part 5 both make effectiveness verification mandatory. Many platforms treat it as optional metadata.
The verification test:in the demo tenant, try to close a corrective action without the effectiveness verification step. The system should refuse. If it lets you through "just for the demo", the platform doesn't enforce the regulation it claims to support.
Three pricing patterns to scrutinise:
Healthier patterns: per-tenant flat, per-tail-aircraft, or per-employee with unlimited reports.
Six red flags to watch in vendor demos:
Exit options:ask for the data export contract clause BEFORE you sign — what format, what timeline, what cost. If the answer is "we'll work it out", walk away. The clause needs to be specific: JSON or CSV, complete tenant export, within 24-48h on request, at no marginal cost during contract or for 12 months after.
Run each shortlisted vendor through these four:
If a vendor passes all four, you have a shortlist of one.