comments (1)

  • Cool project. A team at work was building something similar to internal use.

    I'm curious how this compares to just using Claude Code directly and giving it a dump of the agent traces? It seems like Claude could probably do some of the same diagnostics / trace grouping to identify failure patterns. Why use a custom harness?

    funfunfunction