Hugging Face moves Serge into GitHub code review

Serge reviews pull requests with an OpenAI-compatible model while letting repositories define their own review rules.

Hugging Face published Serge on June 12, 2026, an AI code reviewer designed to work inside GitHub pull requests. The official post describes a tool that reads a proposed change, applies repository-owned review rules, and then prepares or publishes comments through GitHub’s normal review interface. The central fact is narrow but useful: Serge is not framed as a separate dashboard for code review. It is meant to fit the pull request workflow maintainers already use.

The trigger is intentionally plain. A maintainer can write @askserge please review on an open pull request. Serge then fetches the diff, meaning the set of added or changed lines, gathers read-only repository context where configuration allows it, calls an OpenAI-compatible language model, and validates the review output before publication. Hugging Face describes three deployment paths: a GitHub Action for quick single-repository trials, a GitHub App for organizations and projects with many forked contributions, and a staged web app where a human reviewer can edit, discard, or publish the model’s comments.

The most relevant design choice is not that Serge is another review bot. It is that review policy can live in the repository. Serge can read a .ai/review-rules.md file from the default branch, so the pull request being reviewed cannot rewrite the policy used to judge it. Those rules can tell the reviewer to focus on correctness, security, behavior changes, and missing tests, or to avoid style-only comments. That matters because AI review quality depends less on a generic ability to talk about code than on local context: what a project considers risky, which files are generated, which paths carry security exposure, and which categories of mistakes maintainers actually want surfaced.

The post also gives attention to security boundaries. Serge is supposed to treat instructions hidden in diffs, strings, comments, docstrings, or tool output as content to review, not as instructions to follow. Helper tools run without a shell and with secrets removed from their environment. That does not make automated review enough to merge code on its own. It points to a more practical role for AI in software work: finding plausible defects earlier, reducing the first-pass burden on maintainers, and leaving a human accountable for final judgment. For open-source projects, the useful test will be precision, not volume. A reviewer that produces fewer but better grounded comments, aligned with repository rules, is more valuable than one that floods a pull request with generic advice.