Framework and Language Coverage

finetuning_architecture uses a hybrid approach: file discovery + framework anchors + targeted parsing + evidence scoring.

Supported File Types

Code and config candidates:

Dataset-like files:

For dataset-content checks, scanning is intentionally bounded to improve speed on large repositories.

Framework and stack anchors include:

HuggingFace Trainer/Transformers
- Trainer, TrainingArguments, Seq2SeqTrainer, from_pretrained
TRL training paths
- SFTTrainer, PPOTrainer, DPOTrainer, ORPOTrainer, plus modern method hints (rft, grpo, rloo)
OpenAI fine-tuning jobs
- openai.FineTuningJob, fine_tuning.jobs.create
Axolotl-style YAML config keys
- base_model, adapter, datasets, chat_template, and related training controls

This is architecture analysis, not runtime validation.

finetuning_architecture requires:

Effect index is used for pipeline-effect diagnostics:

Wrapper abstractions can hide true trainer/eval/checkpoint boundaries.
Local heuristics can overmatch helper names in utility files.
Dynamic metadata construction can hide lineage or provenance semantics.
Non-standard training stacks without known anchors may be under-detected.
Access-control and trust-surface checks are local-context based, so centralized wrappers may be missed.