An LLM is a generative function. Every input shapes the trajectory of the output. Mastery is shaping every input on purpose.
Imagine two clinically identical colic cases hit Florence in the same week. Same model. Same minion. Same prompt template. The first vet gets a sharp, ranch-specific recommendation grounded in last month’s lab work for that herd. The second vet gets a generic textbook answer that could have come from any veterinary forum. Same function, different output. Why?
Because the answer was never determined by the model alone. It was determined by every token that landed in the context window before the model produced its first output token — the system instructions, the retrieved chunks, the tool returns, the conversation so far, the embedded image, the threshold check that fired or didn’t. The model is a function of its inputs. The output is a trajectory through the space of possible token sequences, and that trajectory is bent — measurably, controllably — by every input the system feeds it.
Vibe coding is what happens when you ignore this: type a prompt, hope for the best, blame the model when the answer is bad. Trajectory engineering is the opposite discipline. It treats every input as a steering surface. V.E.T.S. has six of those surfaces wired into the platform, and a seventh that ties them together. Here is each one.
1. Foundations — the model is a function of its inputs
The base case. A Pr Prompt is not a question — it is a structured instruction that carries role, task, constraints, and context. The same medical query expressed two different ways enters the model as two different vectors and produces two different trajectories.
That vectorization is not metaphor. Every input token is converted into mathematical meaning through Em Embeddings — positions in a high-dimensional space where semantically similar text clusters together. Trajectory engineering starts here: if you understand that “painful belly” and “acute abdominal distress” embed to different neighborhoods, you understand why word choice in the prompt is not stylistic — it is steering.
The generative function itself is the Lg Large Language Model. It maps the assembled input context to a probability distribution over the next token, samples, appends, and repeats. Everything else on this page is a strategy for what to put in front of that function so the sampled trajectory ends where the practitioner needs it to end.
2. Retrieval — the biggest lever is what you put next to the prompt
The prompt is the smallest input. The largest is everything you inject around it. Vx Vector Search selects which knowledge chunks join the prompt in the context window — the chunks that sit closest in embedding space to the user’s query. The selection is the steering. A retriever that pulls the right protocol bends the trajectory toward the right answer; a retriever that pulls a near-miss bends it toward a plausible-sounding wrong one.
Those retrieved chunks are injected into the prompt through the Rg RAG pattern — Retrieval-Augmented Generation. RAG is not a model. It is a discipline for shaping the input so the model’s trajectory is anchored to specific, verifiable knowledge rather than the diffuse statistical average of its training data. In V.E.T.S., RAG is what makes Florence answer about this ranch’s herd rather than about cattle in general.
For the data that must be exact — a vaccination date, a current weight, a drug last administered — retrieval is the wrong tool. Fc Function Calling is the most reliable trajectory steering of all, because it is not probabilistic at all. The model emits a structured tool call, V.E.T.S. executes a stored procedure against the database, and the deterministic result re-enters the context as fresh tokens the model now treats as ground truth. Retrieval bends the trajectory; tool calls clamp it.
3. Modality — inputs are not just text
A wound photograph, an ultrasound still, a scanned vaccination card — each is an input that joins the context window alongside the prompt. Mm Multimodal inputs steer the trajectory in ways pure text cannot. The same prompt “is this healing normally?” produces wildly different trajectories depending on whether the model also sees the wound or has to imagine it. Trajectory engineering treats image inputs with the same intent as text inputs: chosen, framed, captioned.
4. Bounds — catching trajectories that drift
Even perfectly shaped inputs produce probabilistic outputs. Trajectory engineering installs bounds: Gr Guardrails are hard checks against the output before the practitioner ever sees it — dosage outside species limits, contraindicated drug combinations, retrieval confidence below threshold. A guardrail does not improve the trajectory; it refuses to ship a trajectory that landed in the wrong region.
Not every decision needs the heavy model. Sm Small Models are the cheap classifiers that filter, route, and pre-process inputs before the expensive trajectory begins — keyword scoring for minion handoffs, intent detection, fast similarity checks. This is trajectory engineering as economics: spend tokens on the trajectories that matter, not the ones a tiny model can settle in a millisecond.
All of the above runs on proven Fw Frameworks — the patterns and infrastructure that handle retries, structured output parsing, streaming, prompt assembly, and error propagation. Frameworks are not glamorous, but they are how a trajectory survives contact with production. A clever prompt that only works when the network is fast and the JSON is well-formed is not engineering.
5. Training-time engineering — bending the baseline
The previous beats shape inputs at query time. Trajectory engineering also operates at training time: adjust the function itself so its default trajectories are better. Ft Fine-tuning is the slow flywheel: every expert correction in V.E.T.S. becomes training signal, and the next generation of the model starts its trajectories from a slightly better baseline. The vet who corrects Florence today is shaping the trajectory of every query Florence handles next month.
Fine-tuning needs data. For rare conditions — the cases the field has not produced enough of to train on — Sy Synthetic Data fills the gap. Generated examples of uncommon presentations, edge-case workflows, adversarial inputs. The point is not to fake reality; it is to give the model trajectories to learn from before reality forces them.
6. Verification — probing trajectories before they ship, making them legible after
Before trusting a trajectory in production, attack it. Re Red-teaming deliberately probes for the inputs that produce the worst trajectories — prompt injections, hallucination triggers, contradictions across retrieved chunks, dosages right at the species boundary. Trajectory engineering is incomplete without a discipline for finding the trajectories you didn’t plan for.
Once a trajectory has shipped to the practitioner, it must be inspectable. In Interpretability means the vet can see why the answer says what it says: which chunks were retrieved, what their confidence scores were, what tool calls fired, where uncertainty remained. A trajectory you cannot inspect is a trajectory you cannot trust.
For the cases that resist one-shot answers — differential diagnoses, multi-step workups, contradictory evidence — the trajectory itself should be longer. Th Thinking Models are LLMs configured to generate explicit reasoning tokens before the final answer. Chain-of-thought is not a prompting trick — it is a longer, higher-resolution trajectory through the problem, and on hard problems it measurably outperforms the equivalent one-shot.
7. Ownership — who steers the trajectory end-to-end
Everything above is a steering surface. The question that remains is: who is steering? An Ag Agent is the entity that owns a trajectory from input to output — picks the prompt template, runs the retrieval, calls the tools, applies the guardrails, decides when the answer is done. In V.E.T.S., every minion is an agent. Every minion owns a class of trajectories.
Hard problems rarely fit one agent. Ma Multi-Agent systems coordinate multiple owned trajectories — a coordinator decomposes the request, specialists each run their own bounded trajectory, results are composed back. The discipline expands: you are no longer engineering one trajectory, you are engineering a graph of them, plus the handoffs between them.
And the long-arc question: how much of the trajectory should the system steer without asking? Au Autonomy is the dial. At zero, every step requires confirmation. At one, the system completes entire workflows unattended. Trajectory engineering is what makes turning that dial up safely possible — because the further you turn it, the more the quality of unobserved trajectories has to be defensible by construction, not by hope.
The thread
Six leverage surfaces and one owner. That is the working definition of trajectory engineering in V.E.T.S.: every input chosen on purpose, every bound installed on purpose, every default improved on purpose, every trajectory inspectable on purpose, owned by an agent whose scope is clear. The two colic answers we started with weren’t produced by a better model and a worse model. They were produced by a better-shaped context and a worse-shaped one. The model is the same. The engineering is the difference.
Every minion in V.E.T.S. is a managed trajectory. The next tab shows how they coordinate. See the Minion Hierarchy →