CULTURE2024-09-09BY STORECODE

Q&A: using AI tools in code review without losing judgment

Questions we kept getting about where AI assistants fit into review, and how we avoid outsourcing judgment.

culturecode-reviewllmtooling

Q&A

They’re available; they’re not mandatory.

We treat AI assistants as tools for:

They are not:

We pay attention to:

whether reviewers can still explain the change in their own words
whether important decisions (interfaces, data flows, security behavior) are understood by humans

If a reviewer can only say "the assistant didn’t see a problem," that’s a smell.

We scope their use:

ask specific questions ("What are edge cases for this input?"), not "Is this code good?"
limit suggestions to particular files or functions
ignore style-only suggestions when they conflict with our existing conventions

Reviewers remain responsible for deciding which suggestions matter.

We can use them to surface ideas—but we don’t rely on them.

Security reviews and performance-sensitive changes:

We document real issues they help us catch, but we don’t treat that as a guarantee for future changes.

We:

use tools that respect our data-handling requirements
avoid pasting secrets, production-only data, or user-identifying information into prompts
prefer integrations that keep analysis within our environment when possible

We treat prompts like logs: they may be stored or inspected, so we don’t put anything in them we wouldn’t put in a ticket.

It can help by:

It can hurt if:

We coach new reviewers to:

AI assistants can make code review more effective, but only if humans stay in charge of judgment.
Scoped, question-driven use works better than "ask it to review everything."
Security and performance-sensitive work still requires our existing review practices.
Prompts and outputs should be treated with the same care as any other stored artifact.