Must readtop of pipeline
Loops are Replacing Prompts. Verification is About to Be Your Biggest Problem.
Arjun Iyer · article · ~7 min
A loop is not a cron job with better marketing. A cron runs a fixed script; a loop has a decision-maker in the body — a model that reads work state and chooses the next action. The engineering is everything you wrap around that decision so it converges on correct instead of wandering.
concepts
Skills: The Application Layer
Barry Zhang & Mahesh Murag (Anthropic) · video
Model = processor, runtime = OS, skills = application layer. Direct validation of the Abeto/Cortex skills-first investment.
resources
Loop Engineering
Addy Osmani · article · ~9 min
The shift: you held the tool the whole time for two years. Now you build a small system that pokes the agents instead of you. The maker/checker split is the highest-value structural move — the model that wrote the code is too generous grading its own homework.
concepts
Inspect: Ramp Background Coding Agent (75% of Code)
@rahulgs · tweet
The bottleneck was not the model — it was the environment. Most of the work is making the codebase legible and the feedback fast and truthful. Inspect has the context and tools to prove its own work before results reach a human.
concepts
You Shouldn't Be Prompting Coding Agents Anymore — Design Loops
@steipete · tweet
The tweet that named the era: prompting is the old model. The new model is designing a system that does the prompting for you. 6.5M views in one week — the signal that a new discipline had found its name.
concepts
How We Build Effective Agents
Barry Zhang (Anthropic) · video
Spoken-word distillation of Anthropic's "Building Effective Agents" with three crisp rules: don't build agents for everything, keep it simple, think like your agent.
resources
Software 3.0
Andrej Karpathy (interviewed by Stephanie Zhan) · video
Cleanest one-sitting articulation of why the build paradigm flipped. "December 2025 was the agentic phase transition." From the person who coined vibe coding.
resources
Engineers, DELETE the BASH Tool: Agentic Security
IndyDevDan · youtube · ~1 min
Unrestricted Bash in any Claude Code agent is a prompt-injection-to-prod-nuke pipeline. Required reading before Cortex/Hub agents touch any client environment.
resourcesread
How To Approach Your AI Evals
Hamel Husain · youtube · ~5 min
Evals are the verification half of a loop — without them, a loop can only converge on what its feedback can see, which is nothing.
concepts
autoresearch: Remove Yourself as the Bottleneck
@karpathy · tweet
Remove yourself as the bottleneck, put in few tokens, huge amount happens. The cleanest one-line statement of what loop engineering is for — Karpathy's autoresearch loop as the existence proof.
concepts
Trust Layer for AI-Generated Office Files (Second AI Attack)
Nate B Jones · youtube
Hostile-reviewer prompt + 4-stage trust-layer workflow + Ralph-loop spec (Codex ⇄ Opus 4.7) — the canonical real-world instance of two-model QC producing one verified output.
resources
Loop Library
article · ~8 min
A useful loop specifies five things: trigger, action, proof, memory, and a stopping condition. Every entry pairs the prompt with a verify-and-stop note — the evidence that proves the work is done.
resourcesread
Pinecone Just Demoted Vector Search. Here's the Knowledge Layer.
Nate B Jones · youtube · ~1 min
Frames the shift from vector-search-as-RAG to a 'knowledge layer' architecture — directly relevant to the Mini-Me compilation pipeline and Cortex methodology data layer.
resources
Claude on Vertex AI with the ADK
Ivan Nardini (Google Cloud) · video
GCP's 4-piece agent stack (ADK + MCP + Agent Engine + A2A) with Claude on Vertex. Reference baseline for any GCP-resident client conversation.
resources
Software Fundamentals Matter More Than Ever
Matt Pocock · video
Direct pushback on pure spec-to-code: Pocock ran the loop and code got worse each iteration. Counter-narrative ammo for PE conversations.
resources
Full Walkthrough: Workflow for AI Coding
Matt Pocock · video
The clearest external reference for "what good looks like" with skills-driven workflows. Directly maps to your Claude Code skills setup.
resources
Don't Build More AI Agents Until You Watch This
Nate B Jones · youtube · ~14 min
The strongest counterweight to agent-sprawl thinking: before you build another agent, ask if a better loop would do it. Orchestration over proliferation.
concepts
WTF Is a Loop? Part 2: The 15 Loops People Are Actually Using
Matt Van Horn · article
The concrete catalog that proves loop engineering is real adoption, not just a meme. 15 patterns grounded in actual usage — the fastest way to see what a loop looks like in the wild.
resources
Practical Claude Code Tips
Boris Cherny (Anthropic) · video
Codebase Q&A → memory files → `claude -p` SDK as Unix utility. The "start with Q&A for a day" onboarding pattern is the playbook for any new codebase.
resources
LLM-Managed Personal Knowledge Base System
Andrej Karpathy · tweet · ~3 min
Instead of manually curating knowledge bases, let LLMs automatically compile and maintain wikis from raw sources, then query against them - this scales better than manual curation and creates a self-improving research system.
concepts🎯 methodology📤 share
Prompt Injection Vulnerabilities in AI Coding Assistants
article · ~23 min
Prompt injection should be treated as a first-class vulnerability requiring architectural-level security mitigations rather than simple filtering approaches, especially as AI agents gain more system-level privileges and tool access.
concepts🎯 methodology📤 shareread
Software 3.0 and Agentic Programming Evolution
article · ~8 min
Programmers are becoming orchestrators of agents rather than code writers, requiring a shift from line-by-line coding to high-level task delegation and context management.
concepts🎯 methodology📤 shareread
Project Glasswing AI Vulnerability Discovery
article · ~13 min
AI-powered vulnerability discovery is now limited by patching speed rather than finding speed, representing a fundamental shift in cybersecurity where verification and disclosure processes become the bottleneck.
tools🎯 methodology📤 share
AI Coding Agent Evaluation Skills Framework
Hamel Husain · article · ~3 min
Start with the eval-audit skill to diagnose your current evaluation setup, then use specific skills like error-analysis to categorize failures properly rather than lumping different error types into generic scores.
tools🎯 methodology📤 share
Experience Internalization for Continual Learning LLMs
article · ~19 min
For sustainable continual learning in LLMs, use principle-level experience abstraction with step-wise injection and off-policy context-distillation on high-quality teacher trajectories to avoid the capability degradation that occurs with iterative on-policy methods.
concepts🎯 methodologyread
Claude Managed Agents Launch
Lance Martin · tweet · ~5 min
Use managed agent infrastructure for production AI agents to avoid the overhead of maintaining custom harnesses and infrastructure while enabling long-running tasks that can execute over days or weeks.
tools🎯 methodology
AI Output Evolution: Text to Interactive Visual Media
Andrej Karpathy · tweet · ~2 min
Ask LLMs to format responses as HTML instead of markdown to leverage humans' superior visual processing capabilities and get more engaging, easier-to-consume outputs.
concepts🎯 methodology📤 share
Opus 4.7 Productivity Tips from Boris Cherny
article · ~3 min
Auto mode and proper verification patterns are crucial for running multiple Claude instances in parallel and ensuring reliable output from long-running tasks.
tools🎯 methodology
Processing failed
Kevin Gu · tweet · ~4 min
Review manually.
tools
Multi-Agent AI Systems Architecture and Performance
article · ~18 min
Multi-agent systems excel by distributing work across separate context windows for parallel reasoning, with performance primarily driven by total token usage rather than individual agent intelligence.
concepts🎯 methodology📤 share
Context Engineering vs Prompt Engineering
article · ~16 min
Treat context as a finite resource with diminishing returns and actively curate what information gets included in each inference cycle, rather than just focusing on writing better prompts.
concepts🎯 methodology📤 share
How Coding Agents Work with LLMs
Simon Willison · article · ~6 min
Understanding that LLMs are stateless completion engines helps optimize coding agent interactions by leveraging token caching and avoiding modifications to earlier conversation content to control costs.
concepts🎯 methodology📤 share
Fine-tuning Agents with Reverse-Engineered Training Data
article · ~8 min
When building agents without interaction data, start with output artifacts users already produce and reverse-engineer the training data—this creates realistic training examples before you have production metrics.
concepts🎯 methodology📤 share
Authorization Propagation in Multi-Agent AI Systems
article · ~7 min
Identity governance must be treated as infrastructure in multi-agent systems - evaluated continuously and enforced at every interaction boundary before orchestration logic scales, as ordinary system behavior already produces authorization failures.
concepts🎯 methodology📤 share
Claude Opus 4.8 AI Model Release
article · ~8 min
Opus 4.8's enhanced judgment and reliability in agentic tasks makes it suitable for autonomous workflows where models need to work unattended and catch their own mistakes.
tools🎯 methodology
AI Job Impact: Production vs Judgment Tasks
Zack Shapiro · tweet · ~8 min
Focus on developing judgment and contextual decision-making skills rather than production tasks, as AI will make strategic thinking more valuable while commoditizing routine skilled work.
concepts📤 share
Claude Financial Services AI Agent Framework
article · ~7 min
Provides ready-to-deploy financial AI agents that can be customized for firm-specific workflows while maintaining human oversight requirements for compliance and regulatory approval.
tools🎯 methodology
LLM Council Multi-Model Query System
article · ~2 min
Combining multiple LLMs with cross-evaluation can provide more robust answers than single-model queries, and anonymizing responses during peer review prevents bias in the ranking process.
tools🎯 methodology
Trust Layer for AI-Generated Office Files
Nate · article · ~3 min
Build the truth layer first before the polished output - create an inventory of sources, map claims to evidence, and use a two-model review process to catch errors that look correct but are fundamentally wrong.
concepts🎯 methodology📤 share
Enterprise AI Tool Cost Management Strategies
Simon Willison · article · ~2 min
Setting per-tool spending limits rather than total AI budgets allows companies to manage costs while maintaining access to multiple AI tools, with ~10% of engineer compensation being a viable benchmark for AI tool investment.
concepts📤 share
Matt Pocock's Production Agent Skills Library
Yash Thakker · article · ~10 min
These skills represent battle-tested workflows from a practicing engineer, offering a blueprint for moving beyond experimental AI coding to production-ready development practices with proper planning and safety guardrails.
tools🎯 methodology
Claude for Legal Practice Workflows
Zack Shapiro · tweet · ~8 min
General-purpose AI tools like Claude can outperform specialized legal AI products because legal advantage comes from professional judgment in applying tools, not from having firm-specific templates or clause libraries.
tools🎯 methodology📤 share
Palantir's Forward Deployed Engineer Enterprise Model
MindStudio Team · article · ~8 min
Enterprise AI deployment requires embedding technical experts within client organizations because neither side alone has sufficient knowledge to successfully implement AI in production environments.
concepts🎯 methodology📤 share
LLM-as-a-Judge for Automated Model Evaluation
Karyna Naminas · article · ~7 min
LLM judges can replace expensive human evaluation for most AI output assessment tasks because RLHF-trained models have internalized human preferences and can recognize quality even when they can't perfectly generate it.
concepts🎯 methodology📤 share
LLM Judge Model Selection Framework 2026
NVJK Kartik · article · ~7 min
Choose LLM judges based on calibration against YOUR specific rubric rather than generic benchmarks, as judge model changes can silently break evaluation pipelines while maintaining misleading consistency scores.
tools🎯 methodology📤 share
Enterprise LLM Wiki Knowledge Management Pattern
article · ~15 min
Personal knowledge management patterns break at company scale not due to technical limitations but because they require dedicated human curation - enterprise versions must automate both ingestion and maintenance to succeed.
concepts🎯 methodology📤 share
LLM as Judge Pattern for Agent Safety
MindStudio Team · article · ~17 min
Using a second LLM to validate agent outputs catches contextual errors that static rules miss, making it essential for high-stakes workflows like automated emails, database updates, or financial transactions.
concepts🎯 methodology📤 share
Claude Code Memory System Architecture
orchestrator.dev · article · ~5 min
Configure CLAUDE.md properly and understand how auto memory, Memory Tool, context compaction, and subagent memory layers work together to eliminate the need to re-explain the same codebase details in every session.
tools🎯 methodology
AI Disruption of BigLaw Economic Model
Zack Shapiro · tweet · ~8 min
When AI can compress complex legal work from thousands of associate hours to partner-level work in hours, the fundamental economics of leveraged professional services collapse, creating opportunities for new service models that compete on individual capability rather than institutional scale.
concepts📤 share
OpenKB - Open Source Knowledge Base System
article · ~7 min
Persistent knowledge compilation is more efficient than real-time retrieval because it builds accumulated understanding that improves over time rather than starting from scratch on each query.
tools🎯 methodology📤 share
LLM Wiki vs RAG Knowledge Management
article · ~8 min
Choose LLM wiki for bounded, stable personal knowledge bases under 100 articles, and RAG for dynamic, large-scale enterprise systems - they solve different versions of the same knowledge access problem.
concepts🎯 methodology📤 share
AI Agent Workflows for 10x Engineering Productivity
Rhea Purohit · article · ~9 min
Build AI workflows where each task makes the next one easier by investing extra care in the planning phase before any code is written - create detailed specs and review them before implementation to avoid costly mistakes downstream.
concepts🎯 methodology📤 share
Processing failed
article · ~2 min
Review manually.
resources
AI-Native Company Operations and Workforce
Lenny Rachitsky · article · ~5 min
Companies can become AI-first by having leadership model AI usage, hosting internal prompt-sharing sessions, and designating AI operations specialists to help teams integrate AI tools effectively into their workflows.
concepts🎯 methodology📤 share
Claude Code Performance Issues and Fixes
article · ~8 min
When AI product performance degrades, investigate multiple potential causes simultaneously as seemingly broad issues may actually be several distinct problems affecting different user segments on different timelines.
tools🎯 methodology
/grill-with-docs: Enhanced AI Collaboration with Documentation
article · ~8 min
Combining AI questioning with real-time documentation prevents re-explaining domain concepts and builds progressively better shared language between humans and AI across sessions.
tools🎯 methodology📤 share
Real-time Search Quality Evaluation Systems
article · ~3 min
Static test data is insufficient for evaluating search quality - you need real-time evaluation against actual user conversations to catch knowledge base changes and evolving customer issues that would otherwise go undetected.
concepts🎯 methodology📤 share
Claude Agent SDK for Building AI Agents
article · ~8 min
Giving AI agents access to the same computer tools humans use (terminal, file system, etc.) unlocks more effective general-purpose agents that can handle complex, iterative tasks across diverse domains.
tools🎯 methodology📤 share
Agent Engineering Framework and Definition
Latent.Space · article · ~9 min
Since no one agrees on what constitutes an 'agent', focus on the six practical elements rather than debating definitions - this gives you a concrete framework for building and evaluating agentic systems.
concepts🎯 methodology📤 share
Forward Deployed Engineers in Enterprise AI
article · ~6 min
When evaluating AI vendor FDE services, focus on who pays the costs and whether the engagement builds internal capabilities - flat FDE effort across deployments signals dangerous vendor dependency rather than true capability transfer.
concepts🎯 methodology📤 share
Harness Engineering and Adversarial AI Architecture
Eric · article · ~8 min
For complex AI tasks, shift from perfecting prompts to designing adversarial agent architectures where a separate Evaluator agent provides external critique to drive iterative improvement and prevent generic outputs.
concepts🎯 methodology📤 share
AI Agent Memory Benchmarks and Architectures 2026
article · ~17 min
Memory is now a first-class architectural component with measurable performance gaps, enabling production-scale AI agents that maintain context and personalization across sessions rather than being stateless.
concepts🎯 methodology📤 share
AI-Native Business Model and Organizational Structure
Dan Shipper · article · ~13 min
AI enables lean, multifaceted businesses where employees can be generalists using AI-first workflows, allowing small teams to operate multiple business lines that compound off each other through a cycle of experimentation, documentation, building, and teaching.
concepts🎯 methodology📤 share
AI Agent Orchestration Patterns for Production
JobsByCulture · article · ~8 min
Only use multi-agent orchestration when you genuinely need it for context limits, specialization, or parallelism - otherwise stick with well-engineered single-agent systems that are simpler to build and debug.
concepts🎯 methodology📤 share
Anthropic Three-Agent AI Development Architecture
article · ~3 min
Separating the work-performing agent from the evaluation agent significantly improves output quality in long-running AI tasks, while structured handoffs prevent context amnesia that typically causes autonomous agents to fail.
concepts🎯 methodology📤 shareread
Agent-Native Software Architecture Paradigm
Dan Shipper · article · ~7 min
This architecture enables faster development and allows users to modify app behavior through natural language, democratizing software creation beyond traditional coding expertise.
concepts🎯 methodology📤 share
AI Agent Security and Prompt Injection Vulnerabilities
Airia Team · article · ~4 min
Secure agentic systems by mapping data access blast radius, implementing least privilege principles, and limiting agent permissions to only necessary data sources rather than broad organizational access.
concepts🎯 methodology📤 share
Claude Memory Architecture for Persistent Context
article · ~2 min
Build reliable coding agents by implementing structured memory layers that persist only relevant context, rather than carrying forward complete conversation history which causes context drift and failures.
concepts🎯 methodology📤 share
Generative UI for AI Agents
article · ~11 min
Moving beyond text-only chat interfaces to dynamic UI generation makes agent systems more transparent, trustworthy, and effective by exposing agent state and enabling structured interactions.
concepts🎯 methodology📤 shareread
AI Coding Loops vs Direct Prompting
Matt Van Horn · tweet · ~9 min
The future of AI-assisted coding isn't better prompts, but building automated systems that handle the prompting cycle, allowing engineers to work at a higher level of abstraction by writing the orchestration logic rather than the code itself.
concepts🎯 methodology📤 shareread
Anthropic Acquires Stainless SDK Platform
article · ~1 min
The acquisition signals that AI agent capability is fundamentally limited by connectivity infrastructure, making SDK and tooling quality critical for AI platform adoption.
tools🎯 methodology
Agent Literacy: Claude vs Codex Interface Philosophy
Nate B Jones · youtube · ~14 min
Focus on developing 'agent literacy' - the skill of directing agents with clear context, permissions, goals, and success criteria - rather than picking sides in tool debates.
concepts🎯 methodology📤 share
Forward Deployed Engineers in AI Companies
article · ~8 min
The surge in FDE hiring indicates AI companies are shifting focus from pure product development to enterprise deployment and integration, suggesting implementation challenges are a major bottleneck for AI adoption.
concepts🎯 methodology📤 share
Anthropic Claude Managed Agents for Business Automation
Corey Ganim · tweet · ~5 min
The barrier to starting an AI services business dropped from hiring developers to simply describing business workflows, opening opportunities for $1,500-5,000 setup fees plus recurring revenue.
tools🎯 methodology
Comparing gstack, Superpowers, and Compound Engineering Tools
Vox · tweet · ~7 min
Most developers install one AI coding tool thinking they're covered, but these tools solve different problems and work best when used together as complementary layers in your development process.
tools🎯 methodology
Enterprise AI Adoption: Four Stages Framework
ashu garg · tweet · ~8 min
Founders selling to enterprises should focus on moving clients from pilot sprawl to measurable outcomes by partnering as guides who can demonstrate clear ROI and business impact.
concepts🎯 methodology📤 share
PwC Claude Enterprise AI Implementation Strategy
article · ~7 min
Enterprise AI adoption succeeds when focused on end-to-end task completion in high-accuracy domains rather than just pilots, with the biggest gains coming from agentic systems that let experienced professionals operate at unprecedented scale.
concepts🎯 methodology
Compound Engineering 8-Step Framework Evolution
Kieran Klaassen · article · ~6 min
As AI becomes more capable at execution, human value shifts to the 'sandwich' approach - defining what's worth building at the start and ensuring the final product feels right at the end, while letting AI handle the technical implementation in between.
concepts🎯 methodology📤 share
Claude Code Agent View CLI Dashboard
https://www.facebook.com/testingcatalog · article · ~2 min
Agent View transforms Claude Code from a chat tool into an agent operations center, enabling developers to orchestrate multiple coding workflows simultaneously and scale their development processes.
tools🎯 methodology
Claude Platform Updates and Multi-Agent Features
Simon Willison · article · ~7 min
The 'advisor strategy' of using Opus to guide smaller models can achieve frontier-model quality at 5x lower cost, suggesting a practical approach to balance performance and economics in AI applications.
tools🎯 methodology📤 share
AI Context Persistence Problem in Development
Taelin · tweet · ~1 min
Domain-specific fine-tuning represents a major product opportunity since current context management solutions fail to preserve and retrieve specialized knowledge across AI sessions.
concepts🎯 methodology📤 share
Anthropic Claude Managed Agents Platform Launch
Dan Shipper, Marcus Moretti, and Katie Parrott · article · ~3 min
AI platforms are evolving from simple API endpoints to comprehensive hosted infrastructure, suggesting developers should consider managed solutions over building custom agent infrastructure.
tools🎯 methodology
AI Agent Loops vs Human-in-the-Loop
Greg Isenberg · youtube · ~13 min
Human-in-the-loop provides better control and cost-effectiveness for most developers, while fully autonomous AI loops may only be practical for those with unlimited access to AI models.
concepts🎯 methodology📤 shareread
Neofirms: AI-Era Professional Services Evolution
Ryan Daniels · tweet · ~7 min
Success in AI-era professional services requires sacrificing short-term revenue for long-term capability building, structuring firms as corporations rather than partnerships to enable sustained R&D investment in human-AI collaboration.
concepts🎯 methodology📤 share
KPMG-Anthropic Strategic AI Alliance for Enterprise
article · ~4 min
Enterprise AI adoption succeeds when AI is embedded directly into existing work platforms rather than separate tools, reducing friction from weeks of development to minutes for common tasks like building compliance agents.
concepts🎯 methodology
LLM Knowledge Bases for Business Intelligence
article · ~8 min
Companies can reduce the 9.3 hours workers spend searching for information weekly by letting LLMs automatically organize and synthesize internal documents into living knowledge systems that self-update and cross-reference content.
concepts🎯 methodology📤 share
2026 AI Software Architecture Predictions
Dan Shipper 📧 · tweet · ~2 min
As AI reduces software development costs, the bottleneck shifts from engineering capacity to design quality and user experience, creating new opportunities for designers and AI-native developers.
concepts🎯 methodology📤 share
Cortex methodology candidates→ /team/methodology
Auto-flagged as cortex-relevant. Drafts get composed when 3+ items cluster around a topic and pushed to core/methodology/_drafts/ for review.
LLM-Managed Personal Knowledge Base System
Andrej Karpathy · tweet · ~3 min
Instead of manually curating knowledge bases, let LLMs automatically compile and maintain wikis from raw sources, then query against them - this scales better than manual curation and creates a self-improving research system.
concepts🎯 methodology📤 share
Prompt Injection Vulnerabilities in AI Coding Assistants
article · ~23 min
Prompt injection should be treated as a first-class vulnerability requiring architectural-level security mitigations rather than simple filtering approaches, especially as AI agents gain more system-level privileges and tool access.
concepts🎯 methodology📤 shareread
Software 3.0 and Agentic Programming Evolution
article · ~8 min
Programmers are becoming orchestrators of agents rather than code writers, requiring a shift from line-by-line coding to high-level task delegation and context management.
concepts🎯 methodology📤 shareread
Project Glasswing AI Vulnerability Discovery
article · ~13 min
AI-powered vulnerability discovery is now limited by patching speed rather than finding speed, representing a fundamental shift in cybersecurity where verification and disclosure processes become the bottleneck.
tools🎯 methodology📤 share
AI Coding Agent Evaluation Skills Framework
Hamel Husain · article · ~3 min
Start with the eval-audit skill to diagnose your current evaluation setup, then use specific skills like error-analysis to categorize failures properly rather than lumping different error types into generic scores.
tools🎯 methodology📤 share
Experience Internalization for Continual Learning LLMs
article · ~19 min
For sustainable continual learning in LLMs, use principle-level experience abstraction with step-wise injection and off-policy context-distillation on high-quality teacher trajectories to avoid the capability degradation that occurs with iterative on-policy methods.
concepts🎯 methodologyread
Claude Managed Agents Launch
Lance Martin · tweet · ~5 min
Use managed agent infrastructure for production AI agents to avoid the overhead of maintaining custom harnesses and infrastructure while enabling long-running tasks that can execute over days or weeks.
tools🎯 methodology
AI Output Evolution: Text to Interactive Visual Media
Andrej Karpathy · tweet · ~2 min
Ask LLMs to format responses as HTML instead of markdown to leverage humans' superior visual processing capabilities and get more engaging, easier-to-consume outputs.
concepts🎯 methodology📤 share
Opus 4.7 Productivity Tips from Boris Cherny
article · ~3 min
Auto mode and proper verification patterns are crucial for running multiple Claude instances in parallel and ensuring reliable output from long-running tasks.
tools🎯 methodology
Multi-Agent AI Systems Architecture and Performance
article · ~18 min
Multi-agent systems excel by distributing work across separate context windows for parallel reasoning, with performance primarily driven by total token usage rather than individual agent intelligence.
concepts🎯 methodology📤 share
Context Engineering vs Prompt Engineering
article · ~16 min
Treat context as a finite resource with diminishing returns and actively curate what information gets included in each inference cycle, rather than just focusing on writing better prompts.
concepts🎯 methodology📤 share
How Coding Agents Work with LLMs
Simon Willison · article · ~6 min
Understanding that LLMs are stateless completion engines helps optimize coding agent interactions by leveraging token caching and avoiding modifications to earlier conversation content to control costs.
concepts🎯 methodology📤 share
Fine-tuning Agents with Reverse-Engineered Training Data
article · ~8 min
When building agents without interaction data, start with output artifacts users already produce and reverse-engineer the training data—this creates realistic training examples before you have production metrics.
concepts🎯 methodology📤 share
Authorization Propagation in Multi-Agent AI Systems
article · ~7 min
Identity governance must be treated as infrastructure in multi-agent systems - evaluated continuously and enforced at every interaction boundary before orchestration logic scales, as ordinary system behavior already produces authorization failures.
concepts🎯 methodology📤 share
Claude Opus 4.8 AI Model Release
article · ~8 min
Opus 4.8's enhanced judgment and reliability in agentic tasks makes it suitable for autonomous workflows where models need to work unattended and catch their own mistakes.
tools🎯 methodology
Claude Financial Services AI Agent Framework
article · ~7 min
Provides ready-to-deploy financial AI agents that can be customized for firm-specific workflows while maintaining human oversight requirements for compliance and regulatory approval.
tools🎯 methodology
LLM Council Multi-Model Query System
article · ~2 min
Combining multiple LLMs with cross-evaluation can provide more robust answers than single-model queries, and anonymizing responses during peer review prevents bias in the ranking process.
tools🎯 methodology
Trust Layer for AI-Generated Office Files
Nate · article · ~3 min
Build the truth layer first before the polished output - create an inventory of sources, map claims to evidence, and use a two-model review process to catch errors that look correct but are fundamentally wrong.
concepts🎯 methodology📤 share
Matt Pocock's Production Agent Skills Library
Yash Thakker · article · ~10 min
These skills represent battle-tested workflows from a practicing engineer, offering a blueprint for moving beyond experimental AI coding to production-ready development practices with proper planning and safety guardrails.
tools🎯 methodology
Claude for Legal Practice Workflows
Zack Shapiro · tweet · ~8 min
General-purpose AI tools like Claude can outperform specialized legal AI products because legal advantage comes from professional judgment in applying tools, not from having firm-specific templates or clause libraries.
tools🎯 methodology📤 share
Palantir's Forward Deployed Engineer Enterprise Model
MindStudio Team · article · ~8 min
Enterprise AI deployment requires embedding technical experts within client organizations because neither side alone has sufficient knowledge to successfully implement AI in production environments.
concepts🎯 methodology📤 share
LLM-as-a-Judge for Automated Model Evaluation
Karyna Naminas · article · ~7 min
LLM judges can replace expensive human evaluation for most AI output assessment tasks because RLHF-trained models have internalized human preferences and can recognize quality even when they can't perfectly generate it.
concepts🎯 methodology📤 share
LLM Judge Model Selection Framework 2026
NVJK Kartik · article · ~7 min
Choose LLM judges based on calibration against YOUR specific rubric rather than generic benchmarks, as judge model changes can silently break evaluation pipelines while maintaining misleading consistency scores.
tools🎯 methodology📤 share
Enterprise LLM Wiki Knowledge Management Pattern
article · ~15 min
Personal knowledge management patterns break at company scale not due to technical limitations but because they require dedicated human curation - enterprise versions must automate both ingestion and maintenance to succeed.
concepts🎯 methodology📤 share
LLM as Judge Pattern for Agent Safety
MindStudio Team · article · ~17 min
Using a second LLM to validate agent outputs catches contextual errors that static rules miss, making it essential for high-stakes workflows like automated emails, database updates, or financial transactions.
concepts🎯 methodology📤 share
Claude Code Memory System Architecture
orchestrator.dev · article · ~5 min
Configure CLAUDE.md properly and understand how auto memory, Memory Tool, context compaction, and subagent memory layers work together to eliminate the need to re-explain the same codebase details in every session.
tools🎯 methodology
OpenKB - Open Source Knowledge Base System
article · ~7 min
Persistent knowledge compilation is more efficient than real-time retrieval because it builds accumulated understanding that improves over time rather than starting from scratch on each query.
tools🎯 methodology📤 share
LLM Wiki vs RAG Knowledge Management
article · ~8 min
Choose LLM wiki for bounded, stable personal knowledge bases under 100 articles, and RAG for dynamic, large-scale enterprise systems - they solve different versions of the same knowledge access problem.
concepts🎯 methodology📤 share
AI Agent Workflows for 10x Engineering Productivity
Rhea Purohit · article · ~9 min
Build AI workflows where each task makes the next one easier by investing extra care in the planning phase before any code is written - create detailed specs and review them before implementation to avoid costly mistakes downstream.
concepts🎯 methodology📤 share
AI-Native Company Operations and Workforce
Lenny Rachitsky · article · ~5 min
Companies can become AI-first by having leadership model AI usage, hosting internal prompt-sharing sessions, and designating AI operations specialists to help teams integrate AI tools effectively into their workflows.
concepts🎯 methodology📤 share
Claude Code Performance Issues and Fixes
article · ~8 min
When AI product performance degrades, investigate multiple potential causes simultaneously as seemingly broad issues may actually be several distinct problems affecting different user segments on different timelines.
tools🎯 methodology
/grill-with-docs: Enhanced AI Collaboration with Documentation
article · ~8 min
Combining AI questioning with real-time documentation prevents re-explaining domain concepts and builds progressively better shared language between humans and AI across sessions.
tools🎯 methodology📤 share
Real-time Search Quality Evaluation Systems
article · ~3 min
Static test data is insufficient for evaluating search quality - you need real-time evaluation against actual user conversations to catch knowledge base changes and evolving customer issues that would otherwise go undetected.
concepts🎯 methodology📤 share
Claude Agent SDK for Building AI Agents
article · ~8 min
Giving AI agents access to the same computer tools humans use (terminal, file system, etc.) unlocks more effective general-purpose agents that can handle complex, iterative tasks across diverse domains.
tools🎯 methodology📤 share
Agent Engineering Framework and Definition
Latent.Space · article · ~9 min
Since no one agrees on what constitutes an 'agent', focus on the six practical elements rather than debating definitions - this gives you a concrete framework for building and evaluating agentic systems.
concepts🎯 methodology📤 share
Forward Deployed Engineers in Enterprise AI
article · ~6 min
When evaluating AI vendor FDE services, focus on who pays the costs and whether the engagement builds internal capabilities - flat FDE effort across deployments signals dangerous vendor dependency rather than true capability transfer.
concepts🎯 methodology📤 share
Harness Engineering and Adversarial AI Architecture
Eric · article · ~8 min
For complex AI tasks, shift from perfecting prompts to designing adversarial agent architectures where a separate Evaluator agent provides external critique to drive iterative improvement and prevent generic outputs.
concepts🎯 methodology📤 share
AI Agent Memory Benchmarks and Architectures 2026
article · ~17 min
Memory is now a first-class architectural component with measurable performance gaps, enabling production-scale AI agents that maintain context and personalization across sessions rather than being stateless.
concepts🎯 methodology📤 share
AI-Native Business Model and Organizational Structure
Dan Shipper · article · ~13 min
AI enables lean, multifaceted businesses where employees can be generalists using AI-first workflows, allowing small teams to operate multiple business lines that compound off each other through a cycle of experimentation, documentation, building, and teaching.
concepts🎯 methodology📤 share
AI Agent Orchestration Patterns for Production
JobsByCulture · article · ~8 min
Only use multi-agent orchestration when you genuinely need it for context limits, specialization, or parallelism - otherwise stick with well-engineered single-agent systems that are simpler to build and debug.
concepts🎯 methodology📤 share
Anthropic Three-Agent AI Development Architecture
article · ~3 min
Separating the work-performing agent from the evaluation agent significantly improves output quality in long-running AI tasks, while structured handoffs prevent context amnesia that typically causes autonomous agents to fail.
concepts🎯 methodology📤 shareread
Agent-Native Software Architecture Paradigm
Dan Shipper · article · ~7 min
This architecture enables faster development and allows users to modify app behavior through natural language, democratizing software creation beyond traditional coding expertise.
concepts🎯 methodology📤 share
AI Agent Security and Prompt Injection Vulnerabilities
Airia Team · article · ~4 min
Secure agentic systems by mapping data access blast radius, implementing least privilege principles, and limiting agent permissions to only necessary data sources rather than broad organizational access.
concepts🎯 methodology📤 share
Claude Memory Architecture for Persistent Context
article · ~2 min
Build reliable coding agents by implementing structured memory layers that persist only relevant context, rather than carrying forward complete conversation history which causes context drift and failures.
concepts🎯 methodology📤 share
Generative UI for AI Agents
article · ~11 min
Moving beyond text-only chat interfaces to dynamic UI generation makes agent systems more transparent, trustworthy, and effective by exposing agent state and enabling structured interactions.
concepts🎯 methodology📤 shareread
AI Coding Loops vs Direct Prompting
Matt Van Horn · tweet · ~9 min
The future of AI-assisted coding isn't better prompts, but building automated systems that handle the prompting cycle, allowing engineers to work at a higher level of abstraction by writing the orchestration logic rather than the code itself.
concepts🎯 methodology📤 shareread
Anthropic Acquires Stainless SDK Platform
article · ~1 min
The acquisition signals that AI agent capability is fundamentally limited by connectivity infrastructure, making SDK and tooling quality critical for AI platform adoption.
tools🎯 methodology
Agent Literacy: Claude vs Codex Interface Philosophy
Nate B Jones · youtube · ~14 min
Focus on developing 'agent literacy' - the skill of directing agents with clear context, permissions, goals, and success criteria - rather than picking sides in tool debates.
concepts🎯 methodology📤 share
Forward Deployed Engineers in AI Companies
article · ~8 min
The surge in FDE hiring indicates AI companies are shifting focus from pure product development to enterprise deployment and integration, suggesting implementation challenges are a major bottleneck for AI adoption.
concepts🎯 methodology📤 share
Anthropic Claude Managed Agents for Business Automation
Corey Ganim · tweet · ~5 min
The barrier to starting an AI services business dropped from hiring developers to simply describing business workflows, opening opportunities for $1,500-5,000 setup fees plus recurring revenue.
tools🎯 methodology
Comparing gstack, Superpowers, and Compound Engineering Tools
Vox · tweet · ~7 min
Most developers install one AI coding tool thinking they're covered, but these tools solve different problems and work best when used together as complementary layers in your development process.
tools🎯 methodology
Enterprise AI Adoption: Four Stages Framework
ashu garg · tweet · ~8 min
Founders selling to enterprises should focus on moving clients from pilot sprawl to measurable outcomes by partnering as guides who can demonstrate clear ROI and business impact.
concepts🎯 methodology📤 share
PwC Claude Enterprise AI Implementation Strategy
article · ~7 min
Enterprise AI adoption succeeds when focused on end-to-end task completion in high-accuracy domains rather than just pilots, with the biggest gains coming from agentic systems that let experienced professionals operate at unprecedented scale.
concepts🎯 methodology
Compound Engineering 8-Step Framework Evolution
Kieran Klaassen · article · ~6 min
As AI becomes more capable at execution, human value shifts to the 'sandwich' approach - defining what's worth building at the start and ensuring the final product feels right at the end, while letting AI handle the technical implementation in between.
concepts🎯 methodology📤 share
Claude Code Agent View CLI Dashboard
https://www.facebook.com/testingcatalog · article · ~2 min
Agent View transforms Claude Code from a chat tool into an agent operations center, enabling developers to orchestrate multiple coding workflows simultaneously and scale their development processes.
tools🎯 methodology
Claude Platform Updates and Multi-Agent Features
Simon Willison · article · ~7 min
The 'advisor strategy' of using Opus to guide smaller models can achieve frontier-model quality at 5x lower cost, suggesting a practical approach to balance performance and economics in AI applications.
tools🎯 methodology📤 share
AI Context Persistence Problem in Development
Taelin · tweet · ~1 min
Domain-specific fine-tuning represents a major product opportunity since current context management solutions fail to preserve and retrieve specialized knowledge across AI sessions.
concepts🎯 methodology📤 share
Anthropic Claude Managed Agents Platform Launch
Dan Shipper, Marcus Moretti, and Katie Parrott · article · ~3 min
AI platforms are evolving from simple API endpoints to comprehensive hosted infrastructure, suggesting developers should consider managed solutions over building custom agent infrastructure.
tools🎯 methodology
AI Agent Loops vs Human-in-the-Loop
Greg Isenberg · youtube · ~13 min
Human-in-the-loop provides better control and cost-effectiveness for most developers, while fully autonomous AI loops may only be practical for those with unlimited access to AI models.
concepts🎯 methodology📤 shareread
Neofirms: AI-Era Professional Services Evolution
Ryan Daniels · tweet · ~7 min
Success in AI-era professional services requires sacrificing short-term revenue for long-term capability building, structuring firms as corporations rather than partnerships to enable sustained R&D investment in human-AI collaboration.
concepts🎯 methodology📤 share
KPMG-Anthropic Strategic AI Alliance for Enterprise
article · ~4 min
Enterprise AI adoption succeeds when AI is embedded directly into existing work platforms rather than separate tools, reducing friction from weeks of development to minutes for common tasks like building compliance agents.
concepts🎯 methodology
LLM Knowledge Bases for Business Intelligence
article · ~8 min
Companies can reduce the 9.3 hours workers spend searching for information weekly by letting LLMs automatically organize and synthesize internal documents into living knowledge systems that self-update and cross-reference content.
concepts🎯 methodology📤 share
2026 AI Software Architecture Predictions
Dan Shipper 📧 · tweet · ~2 min
As AI reduces software development costs, the bottleneck shifts from engineering capacity to design quality and user experience, creating new opportunities for designers and AI-native developers.
concepts🎯 methodology📤 share