Outer Developer Loop (weeks/months) Flashcards

Question

A poor early choice in architectural decisions can lead to _______ later.

Answer 1

costly rewrites ## Footnote This can occur due to wrong orchestrator selection, unclear memory models, or lack of conventions.

Answer 2

* Beads * AGENTS.md * LangGraph * Temporal/“not Temporal” decisions ## Footnote These elements are crucial for making informed architectural choices.

Answer 3

middle/inner loops ## Footnote The goal is to avoid adding complexity to the development process.

Answer 4

* Triaging backlog issues * Running periodic health checks * Fixing low-risk lints/docs ## Footnote This behavior aims to improve system efficiency by proactively addressing issues rather than waiting for demand.

Answer 5

Lack of proactive improvement ## Footnote Without proactive measures, the system cannot effectively enhance its performance.

Answer 6

Deciding on proactive work delegation ## Footnote The outer loop helps determine which tasks can be safely automated and which require human intervention.

Answer 7

* Safe to auto-launch tasks * Tasks that require human trigger ## Footnote Guardrails help maintain control over automated processes and ensure safety.

Answer 8

Optimized for 'humans writing code' ## Footnote This optimization does not account for the orchestration of agents, leading to inefficiencies.

Answer 9

* PR templates assuming manual authorship * Review processes not acknowledging AI-generated changes * Performance metrics undervaluing orchestration skills ## Footnote These frictions hinder the integration of AI and automation in development workflows.

Answer 10

Productivity, quality, learning, etc. ## Footnote Agents exist to enhance various aspects of the development and operational processes.

Answer 11

* Direct prod DB writes * Handling secrets ## Footnote Agents should not be used for sensitive operations that could compromise security.

Answer 12

Roughly, what level of breakage/cost is acceptable in green zones. ## Footnote This helps in making informed decisions regarding agent operations.

Answer 13

Define Zones of Trust (and keep them stable) ## Footnote Establishing clear zones helps manage agent capabilities effectively.

Answer 14

* Green * Yellow * Red ## Footnote Each zone has specific permissions and controls for agent operations.

Answer 15

* Read/write feature branches * Read/write test data * Read/write docs * Read/write staging-only configs ## Footnote This zone allows agents to operate freely within non-production environments.

Answer 16

Agent proposes, human applies ## Footnote This zone requires human oversight for changes to ensure safety.

Answer 17

Read-only (if at all) ## Footnote This zone is highly restricted to protect sensitive information.

Answer 18

* A short justification * A human approver ## Footnote This process ensures accountability and risk management.

Answer 19

* Date * Change * Rationale * Owner ## Footnote Logging changes provides a clear history of governance decisions.

Answer 20

* AGENTS.md in each repo * Standard prompts * CLI tooling ## Footnote These methods help enforce the defined zones consistently.

Answer 21

Prevents ambiguity about agent capabilities ## Footnote Clear definitions help avoid security blocks and encourage developer use.

Answer 22

A clear process for expanding agent capabilities ## Footnote This transparency helps in making informed decisions rather than outright refusals.

Answer 23

* Pull basic metrics * Classify workflows * Decide on model usage * Set policies ## Footnote Active controls help in monitoring and managing AI spending effectively.

Answer 24

* High-value * Medium-value * Low-value ## Footnote This classification helps determine the appropriate model and spending for different tasks.

Answer 25

* Critical refactors * Complex debugging * Deep design work ## Footnote High-value workflows justify more spending on powerful models.

Answer 26

Review prompts and batching ## Footnote This policy helps in managing costs effectively.

Answer 27

Model X ## Footnote Upgrade to Model Y only on request.

Answer 28

* Identify workloads that must stay cheap * Identify workloads allowed to be premium ## Footnote This section helps in maintaining financial sustainability.

Answer 29

TRUE ## Footnote It encourages questioning the necessity of using more expensive models.

Answer 30

Model selection hints ## Footnote This helps in defaulting to cheaper models unless a more powerful model is necessary.

Answer 31

* Total AI spend by project/team * Spend by model tier * Spend by use case * Failed calls due to rate limits/quotas ## Footnote These metrics provide insights into spending patterns and areas for improvement.

Answer 32

To review and standardize components for new projects ## Footnote This catalog helps in simplifying processes and avoiding unnecessary lock-in.

Answer 33

* Current stack for agents * Orchestrator * Memory/context store * Tool registry pattern * Repo conventions * Integration points ## Footnote Regular reviews help identify friction points and opportunities for simplification.

Answer 34

* Does this simplify inner/middle loops, or add friction? * Are we locked in more than we need to be? * Are there obvious, simpler alternatives? ## Footnote These questions guide decision-making regarding component usage.

Answer 35

* Components to standardize * Components to sunset or fence off * Tiny experiments to run before the next review ## Footnote These decisions help streamline future projects and reduce complexity.

Answer 36

To provide defaults for new projects and facilitate discussions about alternatives ## Footnote This approach ensures consistency and reduces the need for rediscovery of workable patterns.

Answer 37

Vendor-specific calls ## Footnote This abstraction allows for easier swapping of providers without requiring a full rewrite.

Answer 38

Prevents becoming overly dependent on a single library ## Footnote This strategy mitigates risks associated with vendor lock-in.

Answer 39

They begin on rails that are known to be decent ## Footnote This reduces the time and effort needed to establish workable patterns.

Answer 40

A catalog that lists tasks that are: * Low-risk * Beneficial if done gradually * Safe to revert ## Footnote Examples include linting, doc updates, generating test skeletons, triaging backlog, and running health checks.

Answer 41

* Linting / formatting * Doc updates from code comments * Generating or refreshing test skeletons * Triaging backlog * Running non-destructive health checks ## Footnote These tasks are designed to be low-risk and beneficial when done gradually.

Answer 42

* Applicable systems/repos * Frequency of execution (e.g., nightly, weekly) * Proof of work (PRs, reports, logs) ## Footnote This ensures clarity on how and when tasks are performed.

Answer 43

* Agents may automatically open PRs for lint-only changes * Agents may never auto-merge; human review required ## Footnote This establishes clear boundaries for automated actions.

Answer 44

* Idle agents operate only on green-zone resources * They write to branches or open PRs, never direct to main ## Footnote This helps maintain control over code changes.

Answer 45

* Continuous incremental improvements without rogue agents * Conscious decision-making on safe background tasks ## Footnote This prevents unexpected issues from maintenance agents affecting critical systems.

Answer 46

* Ask if AI assistance was used * Require a brief note if large chunks are agent-generated * Encourage attaching agent-generated logs or summaries ## Footnote These updates help acknowledge the role of AI in contributions.

Answer 47

* Review with the same or more scrutiny as human-written code * Spot-check for hallucinated abstractions or invented APIs * Look for consistency with established patterns ## Footnote This ensures quality and reliability in code contributions.

Answer 48

* Was an agent involved? * Did our guardrails fail, or were they missing? ## Footnote This helps in understanding the role of AI agents in incidents.

Answer 49

* Prompt design * Agent orchestration * Guardrail design * AGENTS.md stewardship ## Footnote These elements are considered legitimate engineering contributions.

Answer 50

* How we use agents for feature work * How to safely do refactors with agents * How to document AI use in PRs ## Footnote These playbooks guide safe and effective use of AI agents.

Answer 51

* Good AGENTS.md * Sensible prompts * Annotated PRs containing agent-generated code ## Footnote Example repositories serve as models for best practices in AI integration.

Answer 52

* Prevents invisible AI contributions * Engineers feel mastering agent workflows is part of the job * Reduces cultural friction with explicit expectations ## Footnote Clear acknowledgment fosters a better working environment.

Answer 53

* Identify time/effort savings * Recognize incidents or extra work * Monitor unexpected cost spikes or rate limits * Share good patterns created by teams ## Footnote This process helps in continuously improving operations and governance.

Answer 54

* Updates to AGENTS_GOVERNANCE.md * Updates to PLATFORM_PATTERNS.md * New or refined prompt templates * Adjusted zones of trust or idle-task rules * Training or documentation ## Footnote These updates help in refining processes and improving efficiency.

Answer 55

A simple 'Changelog for Agent Policies' ## Footnote This changelog includes date, change, rationale, and a link to retro notes.

Answer 56

guardrail ## Footnote This concept emphasizes learning from past experiences to improve future operations.

Answer 57

Agent can read/write ## Footnote Includes feature branches, test data & fixtures, docs and comments, staging-only configs.

Answer 58

Agent proposes, you apply ## Footnote Includes CI/CD pipelines, Terraform/infra code, shared libraries, and database migrations.

Answer 59

Read-only or off-limits ## Footnote Includes production DBs, secrets, security policies, and PII/sensitive data.

Answer 60

AGENTS_GOVERNANCE.md or at the top of AGENTS.md ## Footnote Treat it as law and do not rely on memory.

Answer 61

To keep track of agent capabilities ## Footnote Helps avoid 'agent creep' into dangerous territory.

Answer 62

* What could go wrong? * How will I notice if it does? * How would I roll back? ## Footnote This strategy helps manage risks associated with agent permissions.

Answer 63

* Edit code in feature branches of repo A * Generate tests * Update docs ## Footnote These capabilities are documented in AGENTS_GOVERNANCE.md.

Answer 64

* Direct writes to production systems * Read customer PII ## Footnote These restrictions help maintain security and data integrity.

Answer 65

FALSE ## Footnote Yellow status means agents can propose changes, but not apply them directly.

Answer 66

Treat models like infrastructure, not magic ## Footnote This emphasizes the importance of budgeting and evaluating the cost-effectiveness of workflows.

Answer 67

Rough budgets ## Footnote Example: “This project gets about $X/month of AI spend.”

Answer 68

* Bulk refactor * Complex tasks ## Footnote Daily small edits should be kept cheap.

Answer 69

* Which workflows were worth the cost? * Which felt overkill for what they delivered? ## Footnote This helps in adjusting habits and optimizing costs.

Answer 70

By workflow, not by habit ## Footnote Avoid defaulting to always using the biggest model.

Answer 71

Cheap/small model ## Footnote This includes tasks like doc generation and format conversions.

Answer 72

Big/expensive model ## Footnote This is suitable for complex tasks like gnarly debugging.

Answer 73

Use model X by default; upgrade to Y only if you’re doing deep reasoning or stuck ## Footnote This encourages efficient model selection.

Answer 74

Kill or refactor ## Footnote Regularly assess high-usage workflows for potential improvements.

Answer 75

* Cached * Batched * Partially done with scripts or grep ## Footnote Look for simpler prompts or patterns that achieve the same outcome.

Answer 76

Kill or shrink one wasteful workflow ## Footnote This task can lead to quick payoffs in efficiency.

Answer 77

* AGENTS.md * prompts/ * WORKLOG.md (optional) ## Footnote Standardizing this structure allows for easier reuse and migration across projects.

Answer 78

Wrap it behind: * small helper functions * a simple client module * CLI wrappers ## Footnote This approach helps avoid scattering raw provider-specific code throughout the project.

Answer 79

FALSE ## Footnote Start with simple scripts and only add complexity when specific bottlenecks are identified.

Answer 80

* Simple scripts * Prompt templates * Logs * Light-weight orchestrator (if helpful) ## Footnote This strategy emphasizes gradual development based on identified needs.

Answer 81

What concrete bottleneck does this new architectural piece solve? ## Footnote If you can't answer this, it's better not to add it yet.

Answer 82

To explicitly decide which kinds of tasks agents can do without supervision ## Footnote This list helps maintain control over automated tasks and ensures safety.

Answer 83

* Linting / formatting * Regenerating docs from comments * Adding missing docstrings * Running read-only health checks and producing reports * Triaging issues (adding labels, grouping, summarizing) ## Footnote These tasks can be performed by agents without direct oversight.

Answer 84

* Changing business logic * Modifying schema/migrations * Touching production configs ## Footnote These tasks require careful oversight due to their potential impact.

Answer 85

Put an **“Auto-safe tasks”** section in AGENTS_GOVERNANCE.md ## Footnote This ensures that idle/autonomous agents do not perform tasks outside the approved list.

Answer 86

* Work in a branch * Open a PR * Clearly label it as “agent-generated maintenance” ## Footnote This process prevents direct writes to main or production configurations.

Answer 87

FALSE ## Footnote The personal rule states that no agent should write directly to these configurations to prevent damage.

Answer 88

To avoid **“death by a thousand tiny PRs”** ## Footnote This helps manage the volume and impact of automated changes.

Answer 89

* Frequency of opening (e.g., nightly/weekly) * Size limits (e.g., <= N files, <= X LOC) ## Footnote These limitations make it easier to manage and revert changes if necessary.

Answer 90

Turn them off first, then refine ## Footnote This approach prevents enduring chaos due to sunk-cost guilt.

Answer 91

Indicate the sections generated with agent and the prompt used ## Footnote Example: “Sections A & B generated with agent, prompt in prompts/refactor-v2.md.”

Answer 92

As if they came from a junior dev ## Footnote Check APIs, hidden assumptions, and invented abstractions.

Answer 93

Add a checkbox for AI assistance used ## Footnote This encourages careful consideration of AI-generated changes.

Answer 94

* Did an agent contribute to this problem? * Did our guardrails fail, or were they missing? ## Footnote These questions help identify the role of AI in incidents.

Answer 95

* Add/adjust a rule in AGENTS_GOVERNANCE.md * Update a prompt template * Tighten green/yellow/red for that area ## Footnote This approach helps improve guardrails.

Answer 96

* Improved prompt templates * Better AGENTS.md * Safer workflows * Reusable patterns developed ## Footnote Recognizing these contributions enhances the value of AI involvement.

Answer 97

* Patterns discovered or improved * Guardrails added * Workflows that are now smoother or safer ## Footnote This keeps you in architect mode, focusing on improvements.

Answer 98

1) Start in the smallest blast radius. * New workflows live in toy projects / side repos first, then staging, then real systems. 2) Prefer reversible decisions. * New config file? Good....Massive cross-repo rewrite tied to one tool? Bad. 3) Always know how to kill it. Any new autonomous/idle behavior should have: * a single flag/setting to turn off, * a clear place to revoke its permissions. 4) If you can’t explain it, don’t approve it. If you don’t understand: * what an agent is allowed to do, * where it runs, * how it’s observed, …it’s not ready to exist in your outer loop.

Outer Developer Loop (weeks/months) Flashcards

(122 cards)