Skip to main content

2. Implementation Phase (~ 5–45 minutes)

Guided Coding workflow diagram highlighting the Implementation Phase. The three phases are Planning, Implementation, and Guiding, with arrows showing iteration back to Planning for large issues or back to Implementation for small issues.1. Planning Phase2. Implementation Phase3. Guiding Phase✓ PR

Hand the coding agent a single plan and ask it to implement:

"With you being an expert .NET developer, can you please implement the following plan for me?"

Then point to the corresponding markdown file. That's it. The agent goes off and works, typically for 5 to 45 minutes depending on the task. I let it run in auto-continue mode without supervising.

The key to a successful implementation phase is having good feedback loops in place:

  • Compilers / transpilers catch invalid syntax
  • Static code analysis tools catch potential issues in the codebase
  • Automated tests validate functional correctness
  • Linters enforce code style
  • Automated benchmarks verify performance
  • Security tools like trivy detect vulnerabilities

These feedback loops let the agent work longer autonomously. When the agent writes code, runs the tests, sees failures, and fixes them — all without your intervention — that's the power of this setup. Modern models execute tests by themselves; you don't have to instruct them to do so. You just need an acceptance criterion that says automated tests should be written.

I once asked an agent to optimize memory allocations in a hot path in Light.PortableResults. It wrote a benchmark, implemented a stack-allocated optimization for cases with fewer than 10 errors, ran the benchmark, and reported: 25% faster runtime and 80% less memory allocated. When I then asked it to eliminate one last remaining heap allocation, it tried, ran the benchmarks again, and saw that performance actually got worse. It rolled back its own changes. These feedback loops are invaluable.

This image shows a Coding Agent conversation where the agent executes automated benchmarks to measure its code optimization