Must readtop of pipeline
Loops are Replacing Prompts. Verification is About to Be Your Biggest Problem.
Arjun Iyer · article · ~7 min
A loop is not a cron job with better marketing. A cron runs a fixed script; a loop has a decision-maker in the body — a model that reads work state and chooses the next action. The engineering is everything you wrap around that decision so it converges on correct instead of wandering.
concepts
autoresearch: Remove Yourself as the Bottleneck
@karpathy · tweet
Remove yourself as the bottleneck, put in few tokens, huge amount happens. The cleanest one-line statement of what loop engineering is for — Karpathy's autoresearch loop as the existence proof.
concepts
Trust Layer for AI-Generated Office Files (Second AI Attack)
Nate B Jones · youtube
Hostile-reviewer prompt + 4-stage trust-layer workflow + Ralph-loop spec (Codex ⇄ Opus 4.7) — the canonical real-world instance of two-model QC producing one verified output.
resources
Inspect: Ramp Background Coding Agent (75% of Code)
@rahulgs · tweet
The bottleneck was not the model — it was the environment. Most of the work is making the codebase legible and the feedback fast and truthful. Inspect has the context and tools to prove its own work before results reach a human.
concepts
Loop Library
article · ~8 min
A useful loop specifies five things: trigger, action, proof, memory, and a stopping condition. Every entry pairs the prompt with a verify-and-stop note — the evidence that proves the work is done.
resourcesread
You Shouldn't Be Prompting Coding Agents Anymore — Design Loops
@steipete · tweet
The tweet that named the era: prompting is the old model. The new model is designing a system that does the prompting for you. 6.5M views in one week — the signal that a new discipline had found its name.
concepts
Loop Engineering
Addy Osmani · article · ~9 min
The shift: you held the tool the whole time for two years. Now you build a small system that pokes the agents instead of you. The maker/checker split is the highest-value structural move — the model that wrote the code is too generous grading its own homework.
concepts
Don't Build More AI Agents Until You Watch This
Nate B Jones · youtube · ~14 min
The strongest counterweight to agent-sprawl thinking: before you build another agent, ask if a better loop would do it. Orchestration over proliferation.
concepts
How To Approach Your AI Evals
Hamel Husain · youtube · ~5 min
Evals are the verification half of a loop — without them, a loop can only converge on what its feedback can see, which is nothing.
concepts
WTF Is a Loop? Part 2: The 15 Loops People Are Actually Using
Matt Van Horn · article
The concrete catalog that proves loop engineering is real adoption, not just a meme. 15 patterns grounded in actual usage — the fastest way to see what a loop looks like in the wild.
resources
Prompt Injection Vulnerabilities in AI Coding Assistants
article · ~23 min
Prompt injection should be treated as a first-class vulnerability requiring architectural-level security mitigations rather than simple filtering approaches, especially as AI agents gain more system-level privileges and tool access.
concepts🎯 methodology📤 shareread
How Coding Agents Work with LLMs
Simon Willison · article · ~6 min
Understanding that LLMs are stateless completion engines helps optimize coding agent interactions by leveraging token caching and avoiding modifications to earlier conversation content to control costs.
concepts🎯 methodology📤 share
AI Coding Agent Evaluation Skills Framework
Hamel Husain · article · ~3 min
Start with the eval-audit skill to diagnose your current evaluation setup, then use specific skills like error-analysis to categorize failures properly rather than lumping different error types into generic scores.
tools🎯 methodology📤 share
Claude Opus 4.8 AI Model Release
article · ~8 min
Opus 4.8's enhanced judgment and reliability in agentic tasks makes it suitable for autonomous workflows where models need to work unattended and catch their own mistakes.
tools🎯 methodology
Experience Internalization for Continual Learning LLMs
article · ~19 min
For sustainable continual learning in LLMs, use principle-level experience abstraction with step-wise injection and off-policy context-distillation on high-quality teacher trajectories to avoid the capability degradation that occurs with iterative on-policy methods.
concepts🎯 methodologyread
Software 3.0 and Agentic Programming Evolution
article · ~8 min
Programmers are becoming orchestrators of agents rather than code writers, requiring a shift from line-by-line coding to high-level task delegation and context management.
concepts🎯 methodology📤 shareread
Project Glasswing AI Vulnerability Discovery
article · ~13 min
AI-powered vulnerability discovery is now limited by patching speed rather than finding speed, representing a fundamental shift in cybersecurity where verification and disclosure processes become the bottleneck.
tools🎯 methodology📤 share
LLM Council Multi-Model Query System
article · ~2 min
Combining multiple LLMs with cross-evaluation can provide more robust answers than single-model queries, and anonymizing responses during peer review prevents bias in the ranking process.
tools🎯 methodology
Trust Layer for AI-Generated Office Files
Nate · article · ~3 min
Build the truth layer first before the polished output - create an inventory of sources, map claims to evidence, and use a two-model review process to catch errors that look correct but are fundamentally wrong.
concepts🎯 methodology📤 share
Enterprise AI Tool Cost Management Strategies
Simon Willison · article · ~2 min
Setting per-tool spending limits rather than total AI budgets allows companies to manage costs while maintaining access to multiple AI tools, with ~10% of engineer compensation being a viable benchmark for AI tool investment.
concepts📤 share
Matt Pocock's Production Agent Skills Library
Yash Thakker · article · ~10 min
These skills represent battle-tested workflows from a practicing engineer, offering a blueprint for moving beyond experimental AI coding to production-ready development practices with proper planning and safety guardrails.
tools🎯 methodology
Palantir's Forward Deployed Engineer Enterprise Model
MindStudio Team · article · ~8 min
Enterprise AI deployment requires embedding technical experts within client organizations because neither side alone has sufficient knowledge to successfully implement AI in production environments.
concepts🎯 methodology📤 share
LLM-as-a-Judge for Automated Model Evaluation
Karyna Naminas · article · ~7 min
LLM judges can replace expensive human evaluation for most AI output assessment tasks because RLHF-trained models have internalized human preferences and can recognize quality even when they can't perfectly generate it.
concepts🎯 methodology📤 share
LLM Judge Model Selection Framework 2026
NVJK Kartik · article · ~7 min
Choose LLM judges based on calibration against YOUR specific rubric rather than generic benchmarks, as judge model changes can silently break evaluation pipelines while maintaining misleading consistency scores.
tools🎯 methodology📤 share
Enterprise LLM Wiki Knowledge Management Pattern
article · ~15 min
Personal knowledge management patterns break at company scale not due to technical limitations but because they require dedicated human curation - enterprise versions must automate both ingestion and maintenance to succeed.
concepts🎯 methodology📤 share
LLM as Judge Pattern for Agent Safety
MindStudio Team · article · ~17 min
Using a second LLM to validate agent outputs catches contextual errors that static rules miss, making it essential for high-stakes workflows like automated emails, database updates, or financial transactions.
concepts🎯 methodology📤 share
Claude Code Memory System Architecture
orchestrator.dev · article · ~5 min
Configure CLAUDE.md properly and understand how auto memory, Memory Tool, context compaction, and subagent memory layers work together to eliminate the need to re-explain the same codebase details in every session.
tools🎯 methodology
AI-Native Company Operations and Workforce
Lenny Rachitsky · article · ~5 min
Companies can become AI-first by having leadership model AI usage, hosting internal prompt-sharing sessions, and designating AI operations specialists to help teams integrate AI tools effectively into their workflows.
concepts🎯 methodology📤 share
Agent Engineering Framework and Definition
Latent.Space · article · ~9 min
Since no one agrees on what constitutes an 'agent', focus on the six practical elements rather than debating definitions - this gives you a concrete framework for building and evaluating agentic systems.
concepts🎯 methodology📤 share
Forward Deployed Engineers in Enterprise AI
article · ~6 min
When evaluating AI vendor FDE services, focus on who pays the costs and whether the engagement builds internal capabilities - flat FDE effort across deployments signals dangerous vendor dependency rather than true capability transfer.
concepts🎯 methodology📤 share
Harness Engineering and Adversarial AI Architecture
Eric · article · ~8 min
For complex AI tasks, shift from perfecting prompts to designing adversarial agent architectures where a separate Evaluator agent provides external critique to drive iterative improvement and prevent generic outputs.
concepts🎯 methodology📤 share
AI Agent Memory Benchmarks and Architectures 2026
article · ~17 min
Memory is now a first-class architectural component with measurable performance gaps, enabling production-scale AI agents that maintain context and personalization across sessions rather than being stateless.
concepts🎯 methodology📤 share
AI-Native Business Model and Organizational Structure
Dan Shipper · article · ~13 min
AI enables lean, multifaceted businesses where employees can be generalists using AI-first workflows, allowing small teams to operate multiple business lines that compound off each other through a cycle of experimentation, documentation, building, and teaching.
concepts🎯 methodology📤 share
AI Agent Orchestration Patterns for Production
JobsByCulture · article · ~8 min
Only use multi-agent orchestration when you genuinely need it for context limits, specialization, or parallelism - otherwise stick with well-engineered single-agent systems that are simpler to build and debug.
concepts🎯 methodology📤 share
Anthropic Three-Agent AI Development Architecture
article · ~3 min
Separating the work-performing agent from the evaluation agent significantly improves output quality in long-running AI tasks, while structured handoffs prevent context amnesia that typically causes autonomous agents to fail.
concepts🎯 methodology📤 shareread
Agent-Native Software Architecture Paradigm
Dan Shipper · article · ~7 min
This architecture enables faster development and allows users to modify app behavior through natural language, democratizing software creation beyond traditional coding expertise.
concepts🎯 methodology📤 share
AI Agent Security and Prompt Injection Vulnerabilities
Airia Team · article · ~4 min
Secure agentic systems by mapping data access blast radius, implementing least privilege principles, and limiting agent permissions to only necessary data sources rather than broad organizational access.
concepts🎯 methodology📤 share
Claude Memory Architecture for Persistent Context
article · ~2 min
Build reliable coding agents by implementing structured memory layers that persist only relevant context, rather than carrying forward complete conversation history which causes context drift and failures.
concepts🎯 methodology📤 share
Generative UI for AI Agents
article · ~11 min
Moving beyond text-only chat interfaces to dynamic UI generation makes agent systems more transparent, trustworthy, and effective by exposing agent state and enabling structured interactions.
concepts🎯 methodology📤 shareread
AI Coding Loops vs Direct Prompting
Matt Van Horn · tweet · ~9 min
The future of AI-assisted coding isn't better prompts, but building automated systems that handle the prompting cycle, allowing engineers to work at a higher level of abstraction by writing the orchestration logic rather than the code itself.
concepts🎯 methodology📤 shareread
Agent Literacy: Claude vs Codex Interface Philosophy
Nate B Jones · youtube · ~14 min
Focus on developing 'agent literacy' - the skill of directing agents with clear context, permissions, goals, and success criteria - rather than picking sides in tool debates.
concepts🎯 methodology📤 share
AI Agent Loops vs Human-in-the-Loop
Greg Isenberg · youtube · ~13 min
Human-in-the-loop provides better control and cost-effectiveness for most developers, while fully autonomous AI loops may only be practical for those with unlimited access to AI models.
concepts🎯 methodology📤 shareread
KPMG-Anthropic Strategic AI Alliance for Enterprise
article · ~4 min
Enterprise AI adoption succeeds when AI is embedded directly into existing work platforms rather than separate tools, reducing friction from weeks of development to minutes for common tasks like building compliance agents.
concepts🎯 methodology
Compound Engineering 8-Step Framework Evolution
Kieran Klaassen · article · ~6 min
As AI becomes more capable at execution, human value shifts to the 'sandwich' approach - defining what's worth building at the start and ensuring the final product feels right at the end, while letting AI handle the technical implementation in between.
concepts🎯 methodology📤 share
Anthropic Acquires Stainless SDK Platform
article · ~1 min
The acquisition signals that AI agent capability is fundamentally limited by connectivity infrastructure, making SDK and tooling quality critical for AI platform adoption.
tools🎯 methodology
PwC Claude Enterprise AI Implementation Strategy
article · ~7 min
Enterprise AI adoption succeeds when focused on end-to-end task completion in high-accuracy domains rather than just pilots, with the biggest gains coming from agentic systems that let experienced professionals operate at unprecedented scale.
concepts🎯 methodology
LLM Knowledge Bases for Business Intelligence
article · ~8 min
Companies can reduce the 9.3 hours workers spend searching for information weekly by letting LLMs automatically organize and synthesize internal documents into living knowledge systems that self-update and cross-reference content.
concepts🎯 methodology📤 share
2026 AI Software Architecture Predictions
Dan Shipper 📧 · tweet · ~2 min
As AI reduces software development costs, the bottleneck shifts from engineering capacity to design quality and user experience, creating new opportunities for designers and AI-native developers.
concepts🎯 methodology📤 share
Forward Deployed Engineers in AI Companies
article · ~8 min
The surge in FDE hiring indicates AI companies are shifting focus from pure product development to enterprise deployment and integration, suggesting implementation challenges are a major bottleneck for AI adoption.
concepts🎯 methodology📤 share
Cortex methodology candidates→ /team/methodology
Auto-flagged as cortex-relevant. Drafts get composed when 3+ items cluster around a topic and pushed to core/methodology/_drafts/ for review.
Prompt Injection Vulnerabilities in AI Coding Assistants
article · ~23 min
Prompt injection should be treated as a first-class vulnerability requiring architectural-level security mitigations rather than simple filtering approaches, especially as AI agents gain more system-level privileges and tool access.
concepts🎯 methodology📤 shareread
How Coding Agents Work with LLMs
Simon Willison · article · ~6 min
Understanding that LLMs are stateless completion engines helps optimize coding agent interactions by leveraging token caching and avoiding modifications to earlier conversation content to control costs.
concepts🎯 methodology📤 share
AI Coding Agent Evaluation Skills Framework
Hamel Husain · article · ~3 min
Start with the eval-audit skill to diagnose your current evaluation setup, then use specific skills like error-analysis to categorize failures properly rather than lumping different error types into generic scores.
tools🎯 methodology📤 share
Claude Opus 4.8 AI Model Release
article · ~8 min
Opus 4.8's enhanced judgment and reliability in agentic tasks makes it suitable for autonomous workflows where models need to work unattended and catch their own mistakes.
tools🎯 methodology
Experience Internalization for Continual Learning LLMs
article · ~19 min
For sustainable continual learning in LLMs, use principle-level experience abstraction with step-wise injection and off-policy context-distillation on high-quality teacher trajectories to avoid the capability degradation that occurs with iterative on-policy methods.
concepts🎯 methodologyread
Software 3.0 and Agentic Programming Evolution
article · ~8 min
Programmers are becoming orchestrators of agents rather than code writers, requiring a shift from line-by-line coding to high-level task delegation and context management.
concepts🎯 methodology📤 shareread
Project Glasswing AI Vulnerability Discovery
article · ~13 min
AI-powered vulnerability discovery is now limited by patching speed rather than finding speed, representing a fundamental shift in cybersecurity where verification and disclosure processes become the bottleneck.
tools🎯 methodology📤 share
LLM Council Multi-Model Query System
article · ~2 min
Combining multiple LLMs with cross-evaluation can provide more robust answers than single-model queries, and anonymizing responses during peer review prevents bias in the ranking process.
tools🎯 methodology
Trust Layer for AI-Generated Office Files
Nate · article · ~3 min
Build the truth layer first before the polished output - create an inventory of sources, map claims to evidence, and use a two-model review process to catch errors that look correct but are fundamentally wrong.
concepts🎯 methodology📤 share
Matt Pocock's Production Agent Skills Library
Yash Thakker · article · ~10 min
These skills represent battle-tested workflows from a practicing engineer, offering a blueprint for moving beyond experimental AI coding to production-ready development practices with proper planning and safety guardrails.
tools🎯 methodology
Palantir's Forward Deployed Engineer Enterprise Model
MindStudio Team · article · ~8 min
Enterprise AI deployment requires embedding technical experts within client organizations because neither side alone has sufficient knowledge to successfully implement AI in production environments.
concepts🎯 methodology📤 share
LLM-as-a-Judge for Automated Model Evaluation
Karyna Naminas · article · ~7 min
LLM judges can replace expensive human evaluation for most AI output assessment tasks because RLHF-trained models have internalized human preferences and can recognize quality even when they can't perfectly generate it.
concepts🎯 methodology📤 share
LLM Judge Model Selection Framework 2026
NVJK Kartik · article · ~7 min
Choose LLM judges based on calibration against YOUR specific rubric rather than generic benchmarks, as judge model changes can silently break evaluation pipelines while maintaining misleading consistency scores.
tools🎯 methodology📤 share
Enterprise LLM Wiki Knowledge Management Pattern
article · ~15 min
Personal knowledge management patterns break at company scale not due to technical limitations but because they require dedicated human curation - enterprise versions must automate both ingestion and maintenance to succeed.
concepts🎯 methodology📤 share
LLM as Judge Pattern for Agent Safety
MindStudio Team · article · ~17 min
Using a second LLM to validate agent outputs catches contextual errors that static rules miss, making it essential for high-stakes workflows like automated emails, database updates, or financial transactions.
concepts🎯 methodology📤 share
Claude Code Memory System Architecture
orchestrator.dev · article · ~5 min
Configure CLAUDE.md properly and understand how auto memory, Memory Tool, context compaction, and subagent memory layers work together to eliminate the need to re-explain the same codebase details in every session.
tools🎯 methodology
AI-Native Company Operations and Workforce
Lenny Rachitsky · article · ~5 min
Companies can become AI-first by having leadership model AI usage, hosting internal prompt-sharing sessions, and designating AI operations specialists to help teams integrate AI tools effectively into their workflows.
concepts🎯 methodology📤 share
Agent Engineering Framework and Definition
Latent.Space · article · ~9 min
Since no one agrees on what constitutes an 'agent', focus on the six practical elements rather than debating definitions - this gives you a concrete framework for building and evaluating agentic systems.
concepts🎯 methodology📤 share
Forward Deployed Engineers in Enterprise AI
article · ~6 min
When evaluating AI vendor FDE services, focus on who pays the costs and whether the engagement builds internal capabilities - flat FDE effort across deployments signals dangerous vendor dependency rather than true capability transfer.
concepts🎯 methodology📤 share
Harness Engineering and Adversarial AI Architecture
Eric · article · ~8 min
For complex AI tasks, shift from perfecting prompts to designing adversarial agent architectures where a separate Evaluator agent provides external critique to drive iterative improvement and prevent generic outputs.
concepts🎯 methodology📤 share
AI Agent Memory Benchmarks and Architectures 2026
article · ~17 min
Memory is now a first-class architectural component with measurable performance gaps, enabling production-scale AI agents that maintain context and personalization across sessions rather than being stateless.
concepts🎯 methodology📤 share
AI-Native Business Model and Organizational Structure
Dan Shipper · article · ~13 min
AI enables lean, multifaceted businesses where employees can be generalists using AI-first workflows, allowing small teams to operate multiple business lines that compound off each other through a cycle of experimentation, documentation, building, and teaching.
concepts🎯 methodology📤 share
AI Agent Orchestration Patterns for Production
JobsByCulture · article · ~8 min
Only use multi-agent orchestration when you genuinely need it for context limits, specialization, or parallelism - otherwise stick with well-engineered single-agent systems that are simpler to build and debug.
concepts🎯 methodology📤 share
Anthropic Three-Agent AI Development Architecture
article · ~3 min
Separating the work-performing agent from the evaluation agent significantly improves output quality in long-running AI tasks, while structured handoffs prevent context amnesia that typically causes autonomous agents to fail.
concepts🎯 methodology📤 shareread
Agent-Native Software Architecture Paradigm
Dan Shipper · article · ~7 min
This architecture enables faster development and allows users to modify app behavior through natural language, democratizing software creation beyond traditional coding expertise.
concepts🎯 methodology📤 share
AI Agent Security and Prompt Injection Vulnerabilities
Airia Team · article · ~4 min
Secure agentic systems by mapping data access blast radius, implementing least privilege principles, and limiting agent permissions to only necessary data sources rather than broad organizational access.
concepts🎯 methodology📤 share
Claude Memory Architecture for Persistent Context
article · ~2 min
Build reliable coding agents by implementing structured memory layers that persist only relevant context, rather than carrying forward complete conversation history which causes context drift and failures.
concepts🎯 methodology📤 share
Generative UI for AI Agents
article · ~11 min
Moving beyond text-only chat interfaces to dynamic UI generation makes agent systems more transparent, trustworthy, and effective by exposing agent state and enabling structured interactions.
concepts🎯 methodology📤 shareread
AI Coding Loops vs Direct Prompting
Matt Van Horn · tweet · ~9 min
The future of AI-assisted coding isn't better prompts, but building automated systems that handle the prompting cycle, allowing engineers to work at a higher level of abstraction by writing the orchestration logic rather than the code itself.
concepts🎯 methodology📤 shareread
Agent Literacy: Claude vs Codex Interface Philosophy
Nate B Jones · youtube · ~14 min
Focus on developing 'agent literacy' - the skill of directing agents with clear context, permissions, goals, and success criteria - rather than picking sides in tool debates.
concepts🎯 methodology📤 share
AI Agent Loops vs Human-in-the-Loop
Greg Isenberg · youtube · ~13 min
Human-in-the-loop provides better control and cost-effectiveness for most developers, while fully autonomous AI loops may only be practical for those with unlimited access to AI models.
concepts🎯 methodology📤 shareread
KPMG-Anthropic Strategic AI Alliance for Enterprise
article · ~4 min
Enterprise AI adoption succeeds when AI is embedded directly into existing work platforms rather than separate tools, reducing friction from weeks of development to minutes for common tasks like building compliance agents.
concepts🎯 methodology
Compound Engineering 8-Step Framework Evolution
Kieran Klaassen · article · ~6 min
As AI becomes more capable at execution, human value shifts to the 'sandwich' approach - defining what's worth building at the start and ensuring the final product feels right at the end, while letting AI handle the technical implementation in between.
concepts🎯 methodology📤 share
Anthropic Acquires Stainless SDK Platform
article · ~1 min
The acquisition signals that AI agent capability is fundamentally limited by connectivity infrastructure, making SDK and tooling quality critical for AI platform adoption.
tools🎯 methodology
PwC Claude Enterprise AI Implementation Strategy
article · ~7 min
Enterprise AI adoption succeeds when focused on end-to-end task completion in high-accuracy domains rather than just pilots, with the biggest gains coming from agentic systems that let experienced professionals operate at unprecedented scale.
concepts🎯 methodology
LLM Knowledge Bases for Business Intelligence
article · ~8 min
Companies can reduce the 9.3 hours workers spend searching for information weekly by letting LLMs automatically organize and synthesize internal documents into living knowledge systems that self-update and cross-reference content.
concepts🎯 methodology📤 share
2026 AI Software Architecture Predictions
Dan Shipper 📧 · tweet · ~2 min
As AI reduces software development costs, the bottleneck shifts from engineering capacity to design quality and user experience, creating new opportunities for designers and AI-native developers.
concepts🎯 methodology📤 share
Forward Deployed Engineers in AI Companies
article · ~8 min
The surge in FDE hiring indicates AI companies are shifting focus from pure product development to enterprise deployment and integration, suggesting implementation challenges are a major bottleneck for AI adoption.
concepts🎯 methodology📤 share