FieldValue
RFC ID005
TitleROSIE TQ Baseline: Reference Dataset and Product Archetype Standards
Version1.1.0
StatusDraft
FocusGAMP 5, tool validation, product archetypes, multi-repo, SVCV

1. Scope

This RFC provides the standard protocol and reference data required to qualify a ROSIE Engine. It also defines product archetypes, ensuring the ROSIE process adapts its validation rigor to the nature of the software product.


2. The Golden Repository (Reference Data)

The TQ baseline includes a “Golden Repository” containing:

2.1 Valid Patterns

100+ variations of @gxp- tags across five languages:

LanguageFile ExtensionsComment Styles
C#.cs//, /* */, ///
Java.java//, /* */, /** */
Python.py#, """
TypeScript.ts, .tsx//, /* */, /** */
Go.go//, /* */

2.2 Invalid Patterns

Purposely broken scenarios for negative testing:

  • Hashes with intentional corruption
  • Circular dependencies in trace graphs
  • Missing signatures
  • Duplicate IDs
  • Malformed YAML front matter

3. Product Archetypes and ASV

Not all GxP products are built equally. The ROSIE Engine must recognize the product archetype declared in the gxp-product.md manifest.

3.1 Archetype Definitions

ArchetypeExamplesValidation FocusEvidence Requirements
SaaS/CloudWeb apps, portalsMulti-tenancy, session authPQ screenshots, API logs
Embedded/EdgeDevice firmware, IoTMemory safety, hardware interopUnit test coverage, IQ hashes
Data/AIAnalytics, ML modelsData lineage, reproducibilityModel weights, dataset hashes
InfrastructurePlatform-as-a-serviceSecurity guardrails, networkingTerraform state, config drifts

3.2 ASV Profiles

ProfileEvidence LevelReview RigorUse Case
minimalLogs onlyAI reviewLow-risk utilities
standardLogs + screenshotsAI + spot human reviewMost products
rigorousFull evidence + videoAI + full human reviewHigh-risk/regulated

4. Self-Validating Continuous Validation (SVCV)

To achieve self-validating status, the ROSIE Engine must perform recursive verification of its own integrity during every CI/CD run.

4.1 Recursive Integrity Check

StepActionFailure Mode
1Tool self-test: Run a subset of the Golden Repository against the enginePipeline halted
2Engine checksum: Verify the engine binary/image hash against a known-good hash stored in the SoRPipeline halted
3Proceed: Only if steps 1-2 pass, scan production codeN/A

4.2 Automated Deviation Reporting

  • Auto-log: Any failure of the Hard-Gate (RFC-002) must automatically generate a deviation report in the SoR
  • Root cause tagging: The AI agent identifies which requirements (URS/FRS) are breached by the current code state

5. Qualification Protocols

5.1 Operational Qualification (OQ)

To pass OQ, a ROSIE Engine must achieve:

MetricTarget
Extraction recall100%
Extraction precision100%
Gate fidelity100%
Archetype awareness100%

5.2 Performance Qualification (PQ)

MetricTarget
ThroughputProcess 1,000-file repo in < 60 seconds
PersistenceSoR correctly archives evidence package
LatencyContinuous tracking of time-to-trace

6. Continuous Validation Lifecycle

StageActionValidation Artifact
CommitSync and semantic checkProposed trace graph
BuildEngine self-testTool health certificate
TestEvidence capture (RFC-003)Execution evidence (JSON)
MergeHard-Gate RRT issueAutomated release certificate