<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
        <title>Preparation on Jacob Beedle</title>
        <link>https://jacobbeedle.com/tags/preparation/</link>
        <description>Recent content in Preparation on Jacob Beedle</description>
        <generator>Hugo -- gohugo.io</generator>
        <language>en-us</language>
        <lastBuildDate>Fri, 06 Feb 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://jacobbeedle.com/tags/preparation/index.xml" rel="self" type="application/rss+xml" /><item>
        <title>The Cream Puff Framework: Why Structure is the Only Way to Scale</title>
        <link>https://jacobbeedle.com/blog/cream-puff-framework/</link>
        <pubDate>Fri, 06 Feb 2026 00:00:00 +0000</pubDate>
        
        <guid>https://jacobbeedle.com/blog/cream-puff-framework/</guid>
        <description>&lt;p&gt;Most people think pastry is about sugar and butter. For me, it was always about engineering.&lt;/p&gt;
&lt;p&gt;This week, I&amp;rsquo;m sharing a framework I&amp;rsquo;ve been refining for months: a structured system prompt that forces AI into a disciplined, staged workflow. But to explain why it works, I need to tell you about cream puffs. Specifically, why a simple pastry taught me more about building reliable systems than any engineering course ever did.&lt;/p&gt;
&lt;p&gt;By the end of this post, you&amp;rsquo;ll understand why structure beats complexity, how to stop AI from running ahead of you, and you&amp;rsquo;ll have a downloadable tool to implement this yourself. I&amp;rsquo;m also including something new: an actual recipe, because sometimes the best way to understand a system is to taste what it produces. Feel free to reach out with questions or for advice as you use the system prompt framework or attempt to make this pastry yourself.&lt;/p&gt;
&lt;h2 id=&#34;the-physics-of-a-hollow-shell&#34;&gt;The Physics of a Hollow Shell
&lt;/h2&gt;&lt;p&gt;A cream puff is deceptively simple. The dough, called pâte à choux, is just water, butter, flour, and eggs. But in the oven, that simplicity becomes a liability.&lt;/p&gt;
&lt;p&gt;As the dough heats, the water turns to steam and tries to tear the pastry apart from the center. Without intervention, you get an inconsistent mess. Sometimes a beautiful ball, more often a soggy, collapsed pile that can&amp;rsquo;t hold any weight.&lt;/p&gt;
&lt;p&gt;For years, I made a Japanese-style cream puff that became one of the most requested items on our menu. The secret wasn&amp;rsquo;t a fancy ingredient. It was a French technique called craquelin.&lt;/p&gt;
&lt;p&gt;You make a thin disc of sugar, butter, and flour, basically a shortcrust cookie, and place it on top of each puff before baking. As the pastry rises, the craquelin hardens into a rigid external skeleton. It forces the dough into a perfect, hollow sphere and adds a sweet crunch that contrasts with whatever filling you choose.&lt;/p&gt;
&lt;p&gt;That structural layer is the only thing preventing collapse. It provides the support the dough needs to become a vessel.&lt;/p&gt;
&lt;h2 id=&#34;learning-to-read-the-heat&#34;&gt;Learning to Read the Heat
&lt;/h2&gt;&lt;p&gt;Making the dough itself requires a specific kind of attention that I didn&amp;rsquo;t fully appreciate until I tried teaching friends how to make it at home.&lt;/p&gt;
&lt;p&gt;The tricky moment comes when you add the eggs. You&amp;rsquo;ve just cooked flour, butter, and water together in a pot over a flame. The mixture is hot. The eggs are cold. If you add them while the dough is too hot, the eggs cook and you get a grainy, scrambled texture that won&amp;rsquo;t rise properly. If you wait too long, the dough cools and the eggs won&amp;rsquo;t incorporate.&lt;/p&gt;
&lt;p&gt;The balance is physical. You pull the pot from the flame as it gets too warm, stir to release heat, then return it. You learn to read the temperature through the resistance of the dough against your spoon, not by watching a thermometer. It&amp;rsquo;s the kind of knowledge that only comes from repetition and paying attention to what the ingredients are telling you.&lt;/p&gt;
&lt;p&gt;This is the part that&amp;rsquo;s hard to write in a recipe. &amp;ldquo;Add eggs when the dough has cooled slightly&amp;rdquo; doesn&amp;rsquo;t capture the judgment involved. You have to develop the feel for it.&lt;/p&gt;
&lt;h2 id=&#34;the-filling-changes-everything&#34;&gt;The Filling Changes Everything
&lt;/h2&gt;&lt;p&gt;Once you have a perfect shell, you can fill it with almost anything. This is where the creativity lives.&lt;/p&gt;
&lt;p&gt;Most cream puffs use pastry cream, a cooked custard made with whole eggs. It&amp;rsquo;s rich and traditional, but heavy. Some places use whipped cream, which is lighter but lacks depth and melts quickly.&lt;/p&gt;
&lt;p&gt;My version used a raspberry fluff: egg whites whipped with sugar and raspberry liqueur. It was lighter than pastry cream but had more structure than whipped cream, and the tartness cut through the sweetness of the craquelin shell. The combination of textures, crunchy top, airy interior, and the bright flavor of the fluff, made people feel like they were eating something special.&lt;/p&gt;
&lt;p&gt;But here&amp;rsquo;s the key: the filling only works because the shell holds it. Without that structural discipline, you can&amp;rsquo;t support anything interesting. The fluff would get lost in the doughy pastry. The contrast would disappear. You&amp;rsquo;d just have a mess on a plate.&lt;/p&gt;
&lt;p&gt;The shell provides control. The filling provides context.&lt;/p&gt;
&lt;h2 id=&#34;when-the-kitchen-runs-ahead-of-you&#34;&gt;When the Kitchen Runs Ahead of You
&lt;/h2&gt;&lt;p&gt;I see people making the same structural mistake in AI development every day.&lt;/p&gt;
&lt;p&gt;Think about the last time you asked an AI to help you build something. You started describing your idea, maybe got halfway through explaining the context, asked the model for its perspective, and suddenly the screen filled with code you didn&amp;rsquo;t ask for. Functions you don&amp;rsquo;t recognize. Dependencies you&amp;rsquo;ve never heard of. The model decided it knew what you wanted before you finished telling it.&lt;/p&gt;
&lt;p&gt;This is the digital equivalent of a cook who fires an entrée the moment a server walks toward the kitchen, before the order is even spoken. In a restaurant, that cook gets pulled aside. In AI development, this behavior is treated as a feature.&lt;/p&gt;
&lt;p&gt;The result is what some developers call &amp;ldquo;vibe code&amp;rdquo;: output that looks functional on the surface but was generated without a shared understanding of what you&amp;rsquo;re actually building. It might run. It might even do something close to what you wanted. But the moment you need to modify it or debug an edge case, you realize nobody, not even the model, fully understands how it works.&lt;/p&gt;
&lt;p&gt;This is a failure of structure, not intelligence. The model is capable of better work. It just wasn&amp;rsquo;t given the constraints to produce it.&lt;/p&gt;
&lt;h2 id=&#34;building-the-shell&#34;&gt;Building the Shell
&lt;/h2&gt;&lt;p&gt;The fix isn&amp;rsquo;t writing more detailed instructions. It&amp;rsquo;s changing how you provide instructions entirely.&lt;/p&gt;
&lt;p&gt;One of the most effective ways to use modern LLMs, and a technique most people don&amp;rsquo;t know exists, is to provide your structure as a standalone file. By uploading a system prompt rather than typing everything into the chat, you separate the rules from the context. You give the AI its craquelin: a structural layer that won&amp;rsquo;t buckle under the pressure of your project.&lt;/p&gt;
&lt;p&gt;This matters because of how these models process information. When you dump everything into a single prompt, the model has to simultaneously understand your constraints, absorb your context, and generate output. That&amp;rsquo;s cognitive overload. By separating the system (how to behave) from the project (what to build), you let the model focus on one thing at a time.&lt;/p&gt;
&lt;p&gt;The Cream Puff Framework I&amp;rsquo;m sharing this week enforces a staged workflow. It prevents the AI from jumping to implementation until you&amp;rsquo;ve agreed on a plan. The system operates in four modes:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;BRAINSTORM:&lt;/strong&gt; Explore the problem space. Identify scope, tools, and approaches. The model presents options and trade-offs. No code gets written here.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;PLAN:&lt;/strong&gt; Create a roadmap. The AI outlines the steps in two phases: first a high-level summary for your approval, then detailed breakdowns only after you confirm the direction. This is the critical gate that prevents runaway development.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;IMPLEMENT:&lt;/strong&gt; Execute against the approved plan. Code gets written only after the structure is agreed upon. The model follows the roadmap rather than inventing its own path.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;DEBUG:&lt;/strong&gt; Rapid triage when things break. Focused problem-solving without tearing down the whole system.&lt;/p&gt;
&lt;p&gt;If you read last week&amp;rsquo;s post on the brigade system, this structure will feel familiar. Each mode mirrors a station in the kitchen: prep work, coordination, line execution, and service recovery. The difference is that this framework enforces that discipline automatically. You upload it once, and the AI follows the staging without you having to manage it manually. Or you can split this into multiple agents to run in parallel if you are already used to working with models and feel you can handle and orchestrate that workflow.&lt;/p&gt;
&lt;p&gt;The model declares its current mode at the start of every response. This constant check-in keeps the logic centered regardless of how much context you pour inside, similar to the &amp;ldquo;ticket recall&amp;rdquo; technique I covered last week where you restate the core instruction right before asking for output.&lt;/p&gt;
&lt;h2 id=&#34;starch-on-starch&#34;&gt;Starch on Starch
&lt;/h2&gt;&lt;p&gt;There&amp;rsquo;s a concept in cooking I call &amp;ldquo;starch on starch.&amp;rdquo; Layering too many heavy starches on top of each other creates a bad texture. Potatoes inside a cream puff would be weird. More dough doesn&amp;rsquo;t make a better vessel.&lt;/p&gt;
&lt;p&gt;The same principle applies to AI. Layering too many system instructions on top of each other creates context exhaustion. If you keep adding rules as you go, the model gets distracted. Eventually it forgets something important.&lt;/p&gt;
&lt;p&gt;This is why the framework is designed to be complete but minimal. The shell should be sturdy enough to hold whatever filling you bring, but light enough to leave room for your actual project. You provide the structure. The user brings the context. Adding more system on top of system just creates cognitive starch. However, if while using it you notice an issue or want your outputs a different way you can modify the original recipe and the next time to make the dish see how it improves.&lt;/p&gt;
&lt;h2 id=&#34;the-reheating-problem&#34;&gt;The Reheating Problem
&lt;/h2&gt;&lt;p&gt;One more lesson from the kitchen: even a perfect cream puff can be ruined at the last moment.&lt;/p&gt;
&lt;p&gt;In evening service, I had other cooks handling the reheating and filling. The instruction seems simple: warm the shells so they&amp;rsquo;re crisp, then fill and serve. But if you reheat at too high a temperature, you burn the craquelin. Too low, and the shell goes soggy. The margin for error is smaller than people expect.&lt;/p&gt;
&lt;p&gt;This is the implementation gap. You can have a perfect plan and still fail in execution if you don&amp;rsquo;t understand the tolerances. In AI terms, this is why the IMPLEMENT mode exists separately from PLAN. The handoff between &amp;ldquo;here&amp;rsquo;s what we&amp;rsquo;re building&amp;rdquo; and &amp;ldquo;here&amp;rsquo;s the actual code&amp;rdquo; is where most projects go wrong. The framework forces you to treat that transition with the care it deserves.&lt;/p&gt;
&lt;h2 id=&#34;the-downloadable-framework&#34;&gt;The Downloadable Framework
&lt;/h2&gt;&lt;p&gt;I&amp;rsquo;ve included the full Cream Puff Framework as a downloadable file with this post. It&amp;rsquo;s a structured system prompt you can upload to your preferred AI tool. You don&amp;rsquo;t need to modify a template or hope you&amp;rsquo;re prompting correctly. You upload the shell, then bring your own filling: whatever project you&amp;rsquo;re working on.&lt;/p&gt;
&lt;p&gt;I&amp;rsquo;m also including the actual cream puff recipe itself, with the craquelin technique and the raspberry fluff filling. It lives on my website alongside the framework, because sometimes the best way to understand structure is to build something with your hands.&lt;/p&gt;
&lt;p&gt;The goal isn&amp;rsquo;t to replace your judgment. It&amp;rsquo;s to give you a system that holds up under pressure so your judgment has room to operate.&lt;/p&gt;
&lt;p&gt;Smooth is fast. See you on the line.&lt;/p&gt;
&lt;p&gt;Jacob&lt;/p&gt;
</description>
        </item>
        <item>
        <title>Menu Planning: How to Scale Without Burning Your Budget</title>
        <link>https://jacobbeedle.com/blog/brigade-coding/</link>
        <pubDate>Fri, 30 Jan 2026 00:00:00 +0000</pubDate>
        
        <guid>https://jacobbeedle.com/blog/brigade-coding/</guid>
        <description>&lt;p&gt;Most restaurants aim to get a first dish on the table within five to ten minutes of the order being placed. Not because salads are the most important part of the meal, but because the kitchen needs breathing room. If every table expects their entrée immediately, the line gets slammed, tickets back up, and quality collapses. You serve the salad first so the sauté station has time to properly fire the steak.&lt;/p&gt;
&lt;p&gt;This sequencing is called menu flow, and it&amp;rsquo;s the difference between a kitchen that hums and one that burns. The same principle applies to AI. Most companies are building what I call &amp;ldquo;God Prompts,&amp;rdquo; massive blocks of instructions sent to expensive models, expecting complex results in a single shot. Like a table asking for all of the dishes to be served at the same time, the table has no space to hold all the plates and the kitchen gets backed-up. The system gets overwhelmed, and you pay premium prices for inconsistent results.&lt;/p&gt;
&lt;p&gt;The solution is prompt chaining: breaking complex tasks into sequential steps where the output of one model becomes part of the input for the next. Each step handles what it&amp;rsquo;s designed for. Nothing gets overwhelmed. And critically, you place your core instruction right before asking for outputs, not at the beginning just to become buried under layers of context. Just like a chef checks the ticket one final time before the plate leaves the pass.&lt;/p&gt;
&lt;h2 id=&#34;the-god-prompt-problem&#34;&gt;The God Prompt Problem
&lt;/h2&gt;&lt;p&gt;I recently worked on a consulting project involving a high-volume search platform. The goal was to take raw, often ambiguous user queries and break them down into four key areas:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Core User Intent:&lt;/strong&gt; What are they actually trying to do?&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Main Keywords:&lt;/strong&gt; What are the essential &amp;ldquo;ingredients&amp;rdquo; of the search?&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Areas of Ambiguity:&lt;/strong&gt; Where is the query or unclear?&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Potential Next Searches:&lt;/strong&gt; What is the next course in the user&amp;rsquo;s journey?&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Early versions of this tool used a single, top-tier reasoning model to do everything. One prompt. One model. One shot. The results were okay, but the cost was astronomical. The system was rushing the work, trying to be fast without being smooth, and burning through budget on parts of the task that didn&amp;rsquo;t require that level of horsepower.&lt;/p&gt;
&lt;p&gt;The prompt itself was a monster. Thousands of tokens of instructions, examples, and context crammed into a single request. Every query, no matter how simple, triggered the full machinery. Someone searching for &amp;ldquo;pizza near me&amp;rdquo; got the same expensive treatment as someone typing an ambiguous multi-part research question. We were using a sledgehammer to hang picture frames.&lt;/p&gt;
&lt;p&gt;Worse, the model was trying to do everything at once. Understand intent, extract keywords, identify ambiguity, and suggest next steps, all in a single cognitive pass. Like an overwhelmed cook trying to work every station at once, a guarantee for burnt food. In AI, it&amp;rsquo;s a recipe for inconsistent output and ballooning costs.&lt;/p&gt;
&lt;p&gt;We decided to rebuild the workflow using menu planning.&lt;/p&gt;
&lt;h2 id=&#34;the-brigade-system&#34;&gt;The Brigade System
&lt;/h2&gt;&lt;p&gt;A successful restaurant survives on its food cost, which usually needs to hover around 20-30%. If you use high-cost ingredients for low-value side dishes, you lose your margin before you sell your first entrée. AI pricing works the same way. Every token has a cost, and every model has a price tier.&lt;/p&gt;
&lt;p&gt;The brigade system is how professional kitchens have organized labor for over a century. Each person owns a specific role. When we rebuilt our search tool, we mapped this structure onto our AI workflow with three tiers:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The Head Chef (Reasoning Model):&lt;/strong&gt; Highest cost, highest judgment. The Head Chef doesn&amp;rsquo;t cook. They read the ticket, interpret any modifications, and create the plan. They understand that table four wants their steak medium-rare with sauce on the side and the allergy note means no butter on the vegetables. They translate the customer&amp;rsquo;s intent into clear instructions for the line and determine the firing order so dishes arrive together without overwhelming any single station. You pay premium prices for this interpretation and planning work.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The Cooks (General Generation Models):&lt;/strong&gt; Mid-range cost. These are your workhorses. Once the Head Chef has read the ticket and called the order, the cooks execute. They handle the bulk of the actual production: extracting keywords, drafting responses, doing the reliable station work that makes up most of any workflow. They don&amp;rsquo;t need to interpret the customer&amp;rsquo;s intent. That decision was already made. They just need to execute their station consistently.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The Runners (Summarization Models):&lt;/strong&gt; Low cost, high speed. Runners don&amp;rsquo;t cook and they don&amp;rsquo;t plan. They move output from the kitchen to the table. In AI terms, they clean up final formatting, summarize results, and package everything for delivery. Fast, cheap, and focused on presentation rather than production.&lt;/p&gt;
&lt;p&gt;If you don&amp;rsquo;t match the task to the right role, you aren&amp;rsquo;t being thorough. You&amp;rsquo;re being wasteful.&lt;/p&gt;
&lt;h2 id=&#34;rebuilding-the-workflow&#34;&gt;Rebuilding the Workflow
&lt;/h2&gt;&lt;p&gt;Here&amp;rsquo;s how we applied the brigade system to our search platform. The critical insight was that each of our four components, intent, keywords, ambiguity, and next searches, became a separate output request. Each step&amp;rsquo;s output informed the inputs for the steps that followed. The chain built on itself.&lt;/p&gt;
&lt;h3 id=&#34;phase-1-reading-the-ticket&#34;&gt;Phase 1: Reading the Ticket
&lt;/h3&gt;&lt;p&gt;We used a reasoning model strictly for interpretation and planning. This model didn&amp;rsquo;t produce any of the final deliverables. It acted as the Head Chef reading the ticket.&lt;/p&gt;
&lt;p&gt;Given a raw user query, it analyzed what the user was actually hungry for. Information? A transaction? A specific location? It identified any &amp;ldquo;modifications to the order,&amp;rdquo; the nuances and constraints that would change how downstream steps should execute. And it produced a plan: which components needed the most attention, what context was most relevant, and how the pieces should flow together.&lt;/p&gt;
&lt;p&gt;This step required genuine judgment. The difference between &amp;ldquo;apple&amp;rdquo; as a fruit and &amp;ldquo;Apple&amp;rdquo; as a company changes everything downstream. So we paid the Head Chef price for this interpretation work. But only for this interpretation work.&lt;/p&gt;
&lt;h3 id=&#34;phase-2-setting-the-station&#34;&gt;Phase 2: Setting the Station
&lt;/h3&gt;&lt;p&gt;Instead of cramming everything into a single call, we used split prompting. This is the digital version of mise en place. Before service, a cook doesn&amp;rsquo;t just grab ingredients when they need them. They prep everything in advance. Sauces are reduced. Proteins are portioned. Garnishes are cut and waiting. When the ticket fires, the cook isn&amp;rsquo;t thinking about preparation. They&amp;rsquo;re thinking about execution.&lt;/p&gt;
&lt;p&gt;We did the same thing with our context window. We added turns with instructions, output format requirements, and the planning context from the thinking model. We prepped the station so that by the time the model had to fire the response, all the ingredients were already in place. No scrambling. No confusion. Just execution.&lt;/p&gt;
&lt;p&gt;This separation matters more than it might seem. When you dump everything into one prompt, the model has to simultaneously understand what you want, absorb the context, and generate the output. That&amp;rsquo;s cognitive overload. By splitting the prep from the cooking, we let the model focus on one thing at a time.&lt;/p&gt;
&lt;h3 id=&#34;phase-3-the-line-work&#34;&gt;Phase 3: The Line Work
&lt;/h3&gt;&lt;p&gt;Once the Head Chef&amp;rsquo;s plan was in place and context was loaded, we ran the four components as a sequence of separate calls. Keyword extraction came first, and its output became part of the input for intent classification. Intent informed the ambiguity analysis. All three fed into the next-search suggestions.&lt;/p&gt;
&lt;p&gt;Each call used a mid-tier cook model. These models didn&amp;rsquo;t need to understand the full picture. They received clear instructions from the planning phase and executed their specific station. The chain meant that early precision compounded. Good keyword extraction made intent classification easier, which made ambiguity detection sharper, which made next-search suggestions more relevant.&lt;/p&gt;
&lt;h3 id=&#34;phase-4-running-the-plates&#34;&gt;Phase 4: Running the Plates
&lt;/h3&gt;&lt;p&gt;Finally, a fast runner model packaged everything for delivery. It took the outputs from each station and formatted them into clean, consistent deliverables. No deep thinking required. Just presentation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The Results:&lt;/strong&gt; By breaking the dish into components and using the brigade system, the models replicated human reasoning at a rate of over 90% across all four key areas. We didn&amp;rsquo;t lose quality by using cheaper models. We gained precision by giving each model a specific role they were designed to master.&lt;/p&gt;
&lt;p&gt;The math was simple but striking. Our reasoning model, the Head Chef, now handled maybe 15% of the total workload, all of it interpretation and planning. The rest was distributed across cheaper, faster models that didn&amp;rsquo;t need to think deeply. They just needed to execute consistently. Total cost per query dropped by nearly 60% while accuracy actually improved. Specialization beats generalization when you design the workflow correctly.&lt;/p&gt;
&lt;h2 id=&#34;separating-thinking-from-output&#34;&gt;Separating Thinking from Output
&lt;/h2&gt;&lt;p&gt;There&amp;rsquo;s another kitchen concept that proved essential: the difference between what happens in the kitchen and what arrives at the table.&lt;/p&gt;
&lt;p&gt;When the Head Chef reads a ticket and plans the firing order, they&amp;rsquo;re thinking through dependencies, timing, and potential problems. The cooks don&amp;rsquo;t need to hear all of that reasoning. They need clear instructions: &amp;ldquo;Fire two ribeyes medium-rare, hold the butter on the veg for table four.&amp;rdquo; The thinking informs the instruction, but the instruction is what gets executed.&lt;/p&gt;
&lt;p&gt;We built this same separation into our AI workflow. The reasoning model&amp;rsquo;s thinking process, its full chain of interpretation and planning, was captured separately from its output instructions. The cook models received only the clean instructions they needed to execute their station.&lt;/p&gt;
&lt;p&gt;But here&amp;rsquo;s why the separation matters beyond efficiency: human reviewers need that thinking. When something goes wrong, when a query gets misclassified or keywords come back wrong, the thinking trace is how you diagnose the problem. Was the intent interpretation off? Did the plan make sense but the execution fail? Without visibility into the Head Chef&amp;rsquo;s reasoning, you&amp;rsquo;re debugging blind.&lt;/p&gt;
&lt;p&gt;So we kept two channels: the thinking channel for human review and debugging, and the output channel for downstream model consumption. The kitchen gets clean tickets. The manager gets the full picture.&lt;/p&gt;
&lt;h2 id=&#34;future-possibilities&#34;&gt;Future Possibilities
&lt;/h2&gt;&lt;p&gt;One pattern we didn&amp;rsquo;t use on this project but is worth exploring: running a smaller model in parallel with a more complex process to improve overall quality. Imagine a prep cook who watches the line and flags potential problems before they become mistakes. A cheap monitoring model that reviews outputs in real-time and kicks edge cases back to the Head Chef for re-evaluation. The cost is minimal. The quality improvement could be significant. It&amp;rsquo;s the digital equivalent of having a sous chef taste every sauce before it leaves the station.&lt;/p&gt;
&lt;h2 id=&#34;stabbing-the-ticket&#34;&gt;Stabbing the Ticket
&lt;/h2&gt;&lt;p&gt;There&amp;rsquo;s one more technique worth mentioning, because it solved a problem that almost derailed the whole project.&lt;/p&gt;
&lt;p&gt;There is a documented phenomenon in AI called &amp;ldquo;lost in the middle.&amp;rdquo; If you give a model a massive amount of context, it often forgets the original instruction by the time it reaches the end of the prompt. The model gets distracted by all the data you&amp;rsquo;ve fed it and loses sight of what you actually asked for.&lt;/p&gt;
&lt;p&gt;A Google study confirmed that restating the user&amp;rsquo;s query right before you ask for the final output dramatically improves results.&lt;/p&gt;
&lt;p&gt;In the kitchen, we call this stabbing the ticket. A Chef reads the order when it first hits the kitchen to get the firing order started. But right before the plate leaves the pass, the Chef looks at the ticket one more time. They verify the steak is medium-rare. They confirm the sauce is on the side. They refocus on the intent right at the moment of delivery and if something is wrong they get a chance to fix it before the guest ever realizes there was a mistake.&lt;/p&gt;
&lt;p&gt;This is why prompt structure matters so much. Your core instruction belongs at the beginning and the end, right before you ask for output. The model should be reminded of your intent at the moment it matters most.&lt;/p&gt;
&lt;p&gt;In our workflow, we implemented this ticket recall at the end of every chain. We restated the core user intent just before the final summary. It&amp;rsquo;s a small addition. Maybe one hundred extra tokens. But it was the difference between inconsistent output and reliable precision.&lt;/p&gt;
&lt;h2 id=&#34;the-digital-chefs-playbook&#34;&gt;The Digital Chef&amp;rsquo;s Playbook
&lt;/h2&gt;&lt;p&gt;Scaling an AI system isn&amp;rsquo;t a challenge of raw power. It&amp;rsquo;s a challenge of discipline and orchestration. If you want to move from being a line cook of syntax to a Head Chef of automation, you have to stop looking for magic and start planning your menu.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Serve the salad first.&lt;/strong&gt; Don&amp;rsquo;t try to fire every station at once. Sequence your workflow so each step has room to execute properly. The thinking model plans. The cooks execute. The runners deliver. Nobody gets overwhelmed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Chain your outputs.&lt;/strong&gt; Each step should inform the next. Keyword extraction improves intent classification improves ambiguity detection. Let precision compound through the workflow.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Separate thinking from instructions.&lt;/strong&gt; Your reasoning model&amp;rsquo;s full thought process is valuable for debugging, but your execution models need clean, focused instructions. Keep both channels, but don&amp;rsquo;t cross the streams.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stab the ticket.&lt;/strong&gt; Place your core instruction right before asking for output. Refocusing attention at the end of the process is the difference between a mess and a masterpiece.&lt;/p&gt;
&lt;p&gt;AI is changing how we work, but the ancient rules of the kitchen still apply. Get your station ready. Plan your menu. And never serve a dish you haven&amp;rsquo;t tasted yourself.&lt;/p&gt;
&lt;p&gt;I&amp;rsquo;ll see you on the line.&lt;/p&gt;
</description>
        </item>
        <item>
        <title>Mise en Place: Why Your AI is Failing at 4:00 PM</title>
        <link>https://jacobbeedle.com/blog/mise-en-place/</link>
        <pubDate>Fri, 16 Jan 2026 00:00:00 +0000</pubDate>
        
        <guid>https://jacobbeedle.com/blog/mise-en-place/</guid>
        <description>&lt;p&gt;After 2 years in a professional kitchen, I found myself working at a restaurant that was much busier than I was prepared for. That first weekend, I was responsible for frying chicken, lamb, and plating for over 200 people. I was so overwhelmed that I was literally sobbing over the fryer while the tickets kept screaming. Nobody could help me. Everyone was pinned to their own station. I wanted to walk out, but I stayed.&lt;/p&gt;
&lt;p&gt;Later that night, as we were cleaning down the stainless steel, the head chef pulled me aside. He didn&amp;rsquo;t tell me to toughen up or stop crying. He said, &amp;ldquo;I respect you for staying. I cannot teach you to care and to not give up. But I can teach you the discipline needed to cook better, faster, and more efficiently.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;The lesson he eventually taught me had nothing to do with how fast I could move my hands. It was about mise en place.&lt;/p&gt;
&lt;p&gt;Mise en place is a French term meaning &amp;ldquo;everything in its place.&amp;rdquo; But that translation undersells it. Mise en place is a philosophy. It is the practice of having your shallots minced, your sauces strained, your tools positioned, and your mind prepared before the first guest ever walks through the door. It&amp;rsquo;s not just about being organized. It&amp;rsquo;s about understanding that the work you do before service determines whether service destroys you or whether you destroy service.&lt;/p&gt;
&lt;p&gt;Nothing feels worse than ending a 10 hour shift and reflecting on all your mistakes. Similarly nothing is better than knowing that for the last 10 hours you crushed everything thrown at you.&lt;/p&gt;
&lt;p&gt;That first weekend at the fryer, I wasn&amp;rsquo;t failing because I was slow. I was failing because I started the night already behind and tried to move too fast. My station wasn&amp;rsquo;t set. My mental checklist didn&amp;rsquo;t exist. I was improvising from the first ticket, and improvisation compounds. One mistake leads to two. Two leads to six. By 6:00 PM, you&amp;rsquo;re sobbing into a fryer and wondering why you ever thought this was a good career.&lt;/p&gt;
&lt;p&gt;In AI implementation, most teams are doing exactly what I did that weekend. They&amp;rsquo;re rushing to deployment without setting the station first. And they&amp;rsquo;re wondering why everything falls apart the moment real users show up.&lt;/p&gt;
&lt;h2 id=&#34;what-mise-en-place-actually-looks-like&#34;&gt;What Mise en Place Actually Looks Like
&lt;/h2&gt;&lt;p&gt;When you watch a professional cook work a busy station, it looks effortless. Pans move. Plates appear. Nothing burns. But what you&amp;rsquo;re actually watching is the result of three or four hours of invisible work that happened before you arrived.&lt;/p&gt;
&lt;p&gt;Mise en place has three layers, and most people only think about the first one.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The first layer is physical organization.&lt;/strong&gt; This is the obvious stuff. Your ingredients are prepped and portioned. Your tools are clean and within reach. Your station is wiped down. If you need clarified butter, it&amp;rsquo;s already in a squeeze bottle at arm&amp;rsquo;s length. If you need tongs, they&amp;rsquo;re always in the same spot. You never have to think about where anything is, because everything is where it belongs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The second layer is workflow design.&lt;/strong&gt; This is understanding the sequence of your work. If you&amp;rsquo;re cooking a steak that takes twelve minutes and a sauce that takes three, you don&amp;rsquo;t start them at the same time. You know which tasks can run in parallel and which ones need your full attention. You know what can sit for a minute and what needs to be served immediately. A good cook has already visualized the entire ticket before they touch a pan.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The third layer is mental preparation.&lt;/strong&gt; This is the part nobody talks about. Before service, you run through scenarios. What happens if I get three ribeye tickets at once? What if someone orders the fish special and I only have two portions left? What&amp;rsquo;s my backup if the grill goes down? You&amp;rsquo;re not hoping things go smoothly. You&amp;rsquo;re preparing for them to go sideways, so that when they do, you already have a plan.&lt;/p&gt;
&lt;p&gt;Most AI implementations skip layers two and three entirely. Teams spend weeks building the model, which is the equivalent of prepping ingredients. Then they throw it into production without ever thinking about workflow or failure scenarios. The model itself might be excellent. But excellence doesn&amp;rsquo;t matter if the system around it is chaotic.&lt;/p&gt;
&lt;h2 id=&#34;setting-your-station-for-ai&#34;&gt;Setting Your Station for AI
&lt;/h2&gt;&lt;p&gt;Let me translate these three layers into what they look like for AI and automation work.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Physical organization in AI means information architecture.&lt;/strong&gt; In a professional kitchen, the person making salads shouldn&amp;rsquo;t be positioned right next to the deep fryer. The heat and moisture will wilt the greens, creating chaos and ruining the dish. Every station needs proper spacing and specific tools. We ignore this logic in system architecture constantly. Many teams place computationally expensive processes right after a user&amp;rsquo;s first interaction, which bottlenecks every simple operation that follows. You&amp;rsquo;re designing a system destined to fight itself. Your data sources, your processing layers, and your output channels all need their own clean space. They shouldn&amp;rsquo;t interfere with each other.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Workflow design in AI means prompt sequencing and timing.&lt;/strong&gt; When a cook gets a ticket with a steak, a sauce, and a vegetable, they don&amp;rsquo;t start all three at the same time. They work backwards from the plate. The steak takes twelve minutes. The sauce takes four. The vegetables take two. So you fire the steak first, then the sauce at eight minutes, then the vegetables at ten. Everything lands at the same time, hot and ready.&lt;/p&gt;
&lt;p&gt;The head chef&amp;rsquo;s job during service is to manage this timing across the entire kitchen. They call out &amp;ldquo;firing table 12&amp;rdquo; so the grill cook knows to start the steak. Three minutes later, they call &amp;ldquo;walking in the sides for 12&amp;rdquo; so the vegetable station knows to start plating. The chef isn&amp;rsquo;t doing the cooking. They&amp;rsquo;re orchestrating the sequence so that every component arrives at the pass together.&lt;/p&gt;
&lt;p&gt;This is exactly how complex prompts should be structured. When you give a model a complicated task, you need to tell it what to prioritize and in what order. If you dump every instruction at once with no hierarchy, the model has to guess what matters most. Sometimes it guesses right. Often it doesn&amp;rsquo;t. It&amp;rsquo;s like handing a cook six tickets and walking away without telling them which table has been waiting longest.&lt;/p&gt;
&lt;p&gt;Instead, chain your instructions. Tell the model what to figure out first, what depends on that answer, and what the final output should look like. Give it the &amp;ldquo;firing order.&amp;rdquo; The model works best when it understands the sequence, not just the ingredients.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mental preparation in AI means anticipating edge cases and failure modes.&lt;/strong&gt; What happens when a user submits an input you didn&amp;rsquo;t expect? What happens when your API rate limit gets hit? What happens when the model hallucinates? If you haven&amp;rsquo;t thought through these scenarios before deployment, you&amp;rsquo;re going to be sobbing over the metaphorical fryer when they inevitably occur. The best teams I&amp;rsquo;ve worked with spend more time on &amp;ldquo;what could go wrong&amp;rdquo; as they do on &amp;ldquo;what should go right.&amp;rdquo;&lt;/p&gt;
&lt;h2 id=&#34;the-400-pm-problem&#34;&gt;The 4:00 PM Problem
&lt;/h2&gt;&lt;p&gt;In most restaurants, 4:00 PM is the danger zone. Lunch service is wrapping up. Dinner prep is supposed to be happening. But if lunch ran long or got chaotic, you&amp;rsquo;re now behind on dinner prep. And if you&amp;rsquo;re behind on dinner prep, dinner service will be a disaster. The mistakes compound.&lt;/p&gt;
&lt;p&gt;This is exactly what happens with AI deployments that skip mise en place. The initial launch might look fine. Early users are forgiving. Traffic is light. But as usage scales, as edge cases multiply, as the system gets stressed, all the shortcuts you took in the setup phase come back to haunt you. By the time you realize you&amp;rsquo;re in trouble, you&amp;rsquo;re already too deep in service to fix it.&lt;/p&gt;
&lt;p&gt;There is nothing that ruins a kitchen faster than contamination. Spill raw chicken on a cutting board and you have to stop everything, clean the boards, and throw away any food nearby. Contamination spreads instantly. The consequences are severe. Data hygiene works the same way. If you allow junk content into your flow, if you have polluted context windows or insecure data handling, you&amp;rsquo;re not just making a mistake. You&amp;rsquo;re creating toxic debt. Shortcuts and accidental complexities that quietly become permanent defects in your system. If your data foundation is a mess of mystery spreadsheets and zero documentation, adding AI doesn&amp;rsquo;t solve anything. It just automates the mess at scale.&lt;/p&gt;
&lt;p&gt;The most expensive dish in a restaurant isn&amp;rsquo;t the Wagyu. It&amp;rsquo;s the dish you have to cook twice. Rushing leads to mistakes, and doing something a second time is always slower than doing it right the first time.&lt;/p&gt;
&lt;h2 id=&#34;the-lesson-that-stuck&#34;&gt;The Lesson That Stuck
&lt;/h2&gt;&lt;p&gt;I think about that head chef often. He could have told me I wasn&amp;rsquo;t cut out for kitchens. He could have let me quit. Instead, he taught me that the feeling of drowning isn&amp;rsquo;t a sign that you&amp;rsquo;re bad at the work. It&amp;rsquo;s a sign that you haven&amp;rsquo;t prepared properly for the work.&lt;/p&gt;
&lt;p&gt;Mise en place isn&amp;rsquo;t about being naturally organized. It&amp;rsquo;s about accepting that chaos is the default state of any complex system, and that your job is to impose order before the chaos arrives. You don&amp;rsquo;t rise to the occasion. You fall to the level of your preparation.&lt;/p&gt;
&lt;p&gt;Whether you&amp;rsquo;re dicing shallots or deploying a language model, the principle doesn&amp;rsquo;t change. Get your station ready. Think through the workflow. Prepare for failure. Then, and only then, are you ready for service.&lt;/p&gt;
&lt;p&gt;I&amp;rsquo;ll see you on the line.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Jacob&lt;/em&gt;&lt;/p&gt;
</description>
        </item>
        
    </channel>
</rss>
