Skip to content
Arxo Arxo

Examples and Report Walkthrough

This page walks through real ml_architecture sample artifacts and how to interpret results.

Existing sample outputs:

  • crates/arxo-engine/src/metrics/ai_observability/ml_architecture/samples/fastapi-workflow-report.json
  • crates/arxo-engine/src/metrics/ai_observability/ml_architecture/samples/fastapi-workflow-report.md

Sample project and config:

  • crates/arxo-engine/src/metrics/ai_observability/ml_architecture/samples/fastapi-workflow/
  • crates/arxo-engine/src/metrics/ai_observability/ml_architecture/samples/ml-architecture-config.yaml

From your project directory, run Arxo with the path to your ML project and config:

Terminal window
arxo analyze \
--path /path/to/your/ml-project \
--config ml-architecture-config.yaml \
--format json \
--output report.json
  • ml_architecture.overall_score
  • ml_architecture.overall_score_extended

Use this as the top-level release-readiness signal before detector-level triage.

  • Core architecture: skew, boundary, pipeline complexity, reproducibility, lineage.
  • Evaluation and controls: eval integrity, data validation, CI integration, fairness.
  • Runtime operations: serving maturity, drift monitoring, monitoring/alerting, staleness, serving ops.
  • Rollout safety: A/B testing, shadow/canary.
  • ml_architecture.gpu_file_count, ml_architecture.database_file_count, ml_architecture.env_config_file_count
  • evidence-heavy modules and repeated low-score patterns
  • ml_architecture.graph.* entries for structure context
  1. Low-score categories in high-centrality modules.
  2. Controls with production blast radius: skew, lineage, eval, serving, drift.
  3. Rollout hardening: A/B, canary, alerting, staleness operations.
  1. If overall_score is low with weak skew + boundary scores, align train/serve interfaces first.
  2. If reproducibility/lineage remain low, prioritize immutable artifacts and dependency pinning.
  3. If serving/drift/alerting are low, harden operational controls before increasing rollout velocity.
  4. Re-run after fixes and verify upward movement in overall_score plus affected detectors.