concepts · tweet · 8 min
AI Prompting as Behavior Engineering
klöss · Jan 30, 2026
Let's get this out of the way now. You don't suck at prompting because you're dumb. You suck at it because you think prompting is a way of asking for something nicely instead of engineering a certain behavior.
If you're anything like most people I've seen struggle with AI, you've had this experience.
You type something reasonable. The output is mid. You rephrase. Still mid. You add more keywords. Somehow worse. Eventually you blame the model, close the tab, and go back to doing things manually.
I'm not here to tell you that you need better prompts.
I'm here to tell you that prompting isn't what you think it is. And until you understand that, you'll keep getting results that make you wonder if AI is even worth the time or hype.
What follows is everything I've learned the hard way.
Years of trial, failure, refinement, and watching people repeat the same mistakes over and over.
This will be comprehensive.
This isn't one of those articles you just scroll through and forget tbh. This is something you'll likely want to bookmark, take notes on, and actually implement,
Because the protocol at the end will rewire how you interact with AI permanently and probably help you substantially, but who knows.
If this stings a little, good. That means it's working.
Yes, you suck at promoting. Yes, you need better prompts.
And I know said you're not dumb. But really you are.
I lied. Sorry not sorry.
Just a temporarily lie though. It's a good thing we can fix you.
Let's begin.
Most prompts look like this:
"Make me a landing page for my startup."
That's not a prompt.
That's a wish tossed into the void, hoping the model somehow reads your fucking mind on all variables of a website.
Here's what people don't understand: A language model is not a genie. It's not trying to figure out what you secretly want.
It's a pattern-completion engine that takes your input and generates the most statistically probable output.
Read that again.
The most statistically probable output.
If your input is vague or shitty, the output will be generic, because generic is what's most probable when it comes to most people and specific vocabulary or direction is absent.
This isn't the model being dumb. This is the model being obedient to what you actually gave it, which was almost nothing. So in fact, you are the dumb one here. Again, sorry to offend your feelings but it's the truth.
A real prompt is a contract. It answers four non-negotiables:
Role: Who is the model role playing as? Task: What exactly must it accomplish? Constraints: What directions are followed? Output format: What does "done" look like?
If even one of these is missing, the model fills the gap with assumptions and you bet your ass it's going to guess wrong and act like a human because that's who provided the input of course.
And assumptions are where hallucinations are born.
You didn't get a bad output. You gave an incomplete contract to the AI. The model honored exactly what you asked for. You just didn't realize how little you gave it or asked for. Or how many things go into a website like colors, themes, buttons, layout, styles, typography, spacing, padding, radius, borders, etc.?
Had you mentioned any of these you probably would've gotten a much better result. Consider that for a moment.
How do you fix this? Before taking on a task, understand what constitutes a truly complete end result.
Here's where it gets truly uncomfortable.
Most people treat AI models like upgraded search engines. Type question, receive answer. But that mental model is catastrophically wrong, and it's also why your results stay mediocre no matter how many "prompt engineering tips or videos" you consume.
Different models are different specialists and skillsets, they're not upgrades of the same brain.
Think about it this way: You wouldn't give identical instructions to your executive assistant, your graphic designer, and your backend developer. So don't act shocked when their outputs differ. Each has different training, different strengths, different failure modes.
Models work the same way.
Some prefer structured natural language. Some need explicit step sequencing. Some collapse under verbose prompts. Some ignore constraints unless you repeat them. Some excel at analysis but are terrible at creativity. Some hallucinate confidently while others hedge risk.
But here's the mistake almost everyone makes.
They write one prompt, reuse it everywhere, and expect identical behavior. Then they blame "AI" as a monolithic thing when results vary wildly.
Prompt portability is a myth. Prompt adaptation is THE skill.
Systems thinking. The person who writes model-specific prompts will outperform the person with "better ideas" every single time. This isn't about being smarter. It's about understanding that the tool changes based on which tool you're actually using.
If you're not converting your approach or prompts per model, you're leaving quality on the table. And you probably don't even realize it.
How do you fix this issue?
Simple.
Learn what each model excels and struggles at.
Learn what each model's use cases are.
Learn what constitutes a complete use case.
Now, you're not going in blind.
I need to say something that might feel counterintuitive.
The reason your prompts fail isn't because you're asking for the wrong thing most of the time. It's because you're too afraid to ask for specifically what you want.
You hedge. You stay vague. You write "make it awesome" instead of defining what awesome even means to you. You don't use the vast vocabulary in your arsenal.
Why?
Because being specific feels risky. What if you specify the wrong thing? What if you constrain too much and miss something better? What if being explicit makes you look like you don't know what you're doing?
So you stay vague, thinking you're keeping options open.
But vagueness isn't flexibility. It's quitting. It's being lazy. You're handing the model a half bake idea and hoping it happens to land what you wanted.
Here's the truth about constraints: Constraints are not limitations. Constraints are instructions.
Think like a movie director and/or creative director.
You are the boss. Act like it.
When you say "never alter design system" or "always maintain the existing copywriting tone" or "never change the reference source subject," you're not restricting the model. You're informing it. You're giving it the same contextual awareness a human collaborator would need via directions.
Humans who work with you learn these constraints over time through feedback, observation, and accumulated context. Think of them like the knick knacks of working with people. Your chemistry in a way. Models don't get to have that luxury yet though for most. Every conversation starts at zero unless you explicitly load the relevant constraints into the prompt. This is a key point to understand.
As AI and context improves, the sky will be the limit.
A search engine like Google databases and queried the world's information. But it was limited in what it could truly do, simply report information. Whereas models of today are quickly accelerating towards infinite advanced context windows, that will be able to act on all information. The paradigm is shifting.
You must say what to preserve, what may evolve, and what must never change if you wish to excel.
Without it, you're not left with much.
Consistency from AI does not come memory. It comes from instruction.
The people who get exceptional results from AI aren't using secret techniques. They're just willing to be uncomfortably specific about what they actually want.
Once you understand this, you'll get better vastly better results.
The architecture of a model's mind, if we can even call it that, responds to true structure.
Not simply the structure of fancy formatting or elaborate JSON, but the structure of clear systems thinking. The model reflects the clarity of your input.
Unclear thinking in, unclear output out.
That's the whole game.
Over years of refinement, I've found that effective prompts share a specific architecture. This isn't arbitrary. It maps to how these models actually process and prioritize information.
Layer 1: Identity
Who is the model in this conversation? Not just "you are a helpful assistant," but a specific fine tuned role with specific skill expertise and specific perspectives.
You might have seen Claude Skills go viral.
That is accessing that same mindset.
"You are a senior product marketer who specializes in B2B SaaS positioning" triggers different skill and role pattern-matching than "you are an AI chat assistant."
The model doesn't "become" this identity. But it accesses different clusters of training data, different stylistic patterns, different skills of patterns, and different reasoning approaches.
The identity of your model or agent matters.
Miss this and you'll get abysmal results.
Layer 2: Context
What does the model need to know to do this task incredibly well? This could include contextual background information, prior decisions, constraints from earlier conversations, and anything that would be obvious to a human but invisible to the model.
Most people dump context like a middle schooler procrastinated essay due: "Here's everything about my app…" Then they wonder why the model forgets things, contradicts itself, and rewrites core logic half way through vibe coding their idea.
Context must be ordered, scoped, and labeled. The model does not "remember" context emotionally. It pattern-matches relevance. If you don't mark what are the rules, what is editable or deprecated, what is historical, and what is ongoing, it treats everything as equally optional. That will give you code nightmares.
And realistically that's on you.
Layer 3: Task
What specific action must be taken by the AI? Not "write something about X" but "produce a 500-word product description that emphasizes the time-saving benefits for busy executives."
The more pr