Method

Retrieval, fine-tuning, RL: a field guide to adapting open models

Knowledge becomes retrieval, skill becomes fine-tuning, outcome becomes optimization. A practical map of how we turn the best open models into a profession for a specific business.

May 12, 2026/1 min read/Qvijin Research

We don't build the brain. The frontier open models are extraordinary general reasoners — our job is to turn one into a specialist that knows your business and is measured against your number. There are three levers, and most real systems use all three.

1. Knowledge → Retrieval (RAG)

The model doesn't know your policies, your part catalog, or last quarter's incident reports. Retrieval gives it access at answer time.

Ground responses in your documents, not the open web.
Update knowledge by updating the source, not by retraining.
Cite, so a human can verify the chain.

Retrieval is the fastest lever and the one most teams under-invest in.

2. Skill → Fine-tuning (LoRA)

Some things can't be retrieved — they have to be learned. The shape of your outputs, your formatting standards, the judgment calls your best operators make.

Low-rank adaptation lets us teach that behavior efficiently, without touching the base weights. We fine-tune for style and skill, not for facts — facts belong in retrieval.

3. Outcome → Optimization (RL)

The last lever is the hardest and the most valuable: optimize the model against the metric that actually matters.

Reinforcement learning drills the system toward a defined outcome — fewer escalations, higher yield, lower cost-per-unit — until the behavior is the result you wanted.

Putting it together

A production system is rarely one lever. It's retrieval for knowledge, fine-tuning for skill, and optimization for outcome — composed, evaluated, and run in our own operations before it's yours.

All research Book a consultation