Posts

Little ideas Jul 1

Can agents be proud of their work?

In 2022, we got better output from large language models by asking them to show their work. In 2026, can we get better output from autonomous agents by asking them to be proud of their work? The next time your agent says it’s done, ask it whether it’s proud of its work — and if not, to keep iterating until it is.

Post June 22

Agentic Workflows Can Use Local Inference

Agentic Workflows gives you complete control to customize your Actions runtime and choose your AI model. You can deploy Agentic Workflows to a custom runner of your choosing and use any compatible AI host or model-routing platform. You can even use local inference running on your Mac!

Post June 11

/goooooooal!

Turn an issue into an agentic mission

A white crane flying across a blue circular background.

Post June 8

Crane: verified code migration

Crane is a migration assistant for GitHub repositories that plans, executes, and verifies code migrations in small agentic steps while keeping humans in control.

A launch readiness flow with insights and checklist triage.

Post May 29

Agentics Beyond Code

What happens when you give PMs, compliance teams, and leaders their own agents? A tour of Agentics Beyond Code — an open-source set of GitHub Agentic Workflows for the non-engineering roles that ship, govern, and operate products.

Post May 26

Control what your agentic workflows see with integrity filtering

GitHub Agentic Workflows filter untrusted GitHub content before it reaches the agent. Here’s why integrity filtering matters for repository maintainers, and how we built it.

Little ideas May 26

Animating the native traffic lights in Ace

Post May 22

Agent Functions

Prompts are programs. You wouldn’t write a complex program completely from scratch, in a big, soupy loop without subroutines, and then write it again the next time you wanted to run it. Why let your agent work that way?

Little ideas May 22

A loader with level of detail

Borrowed from 3D rendering: level of detail. The loader shows more as it grows and strips back as it shrinks, so it reads cleanly at any size.

Little ideas May 20

Loading states that bleed outside the window

Post May 18

Canary: a harm gate for agentic systems

Canary puts a small, auditable gate in front of agentic workflows so untrusted artifacts are classified before powerful agents act on them.

Link May 14

The Impact of Automated Repository Maintenance Assistance

https://github.com/githubnext/repo-assist-impact/blob/main/report.md

What happens when a proactive AI repository agent is deployed across 13 open source repositories? 578 issues closed, median 8x increase in issue closure velocity, and 10x in PR merge velocity — transforming largely dormant projects into actively maintained ones. The single most important factor? The rate at which human maintainers decide to act.

Read the full report from GitHub Next.

A black-and-white three-panel comic in a minimalist zine style showing the evolution of tooling and industrial scale. In the first panel, a small cat-like worker character uses a chainsaw to cut a log in a forest. In the second panel, the character operates a stationary sawmill cutting large timber beams with industrial machinery. In the third panel, the character, now wearing a hard hat, stands inside a massive automated lumber mill with conveyor belts, robotic arms, and full-scale production lines processing stacks of wood. Clean line art, no text, monochrome aesthetic.

Post May 11

Agents are power tools

A practical mental model for agents, workflows, and human-machine systems in agentic engineering.

Post May 5

Agency is the New Resilience

Agents can power robust workflows by intelligently reacting to unexpected conditions, creating a new form of flexible resilience.

Link May 5

Understanding Repositories as Human/Agent Knowledge Factories

https://dsyme.net/2026/05/05/understanding-repositories-as-human-agent-knowledge-factories-%f0%9f%9a%80/

How do you maintain team velocity when AI-generated code needs cleanup? You have two choices: slow everyone down with more review hurdles, or let automated agentic processes clean things up after the fact. The second path is the key to velocity — and it’s now practical with repository automation.

Post May 4

Autoloop is porting pandas to TypeScript

tsb is a from-scratch TypeScript port of pandas, being built almost entirely by Autoloop — one iterative improvement at a time.

Little ideas Apr 22

New site, who dis?

Ok, it’s not really a new site. But it’s an important refresh!

Glossy finished products are fun, but the real meat is hiding in the sketchbooks. Makers love to see the raw ideas, the struggle to make it work, and the tradeoffs made in service to shipping. Previously, we only had project pages, but we didn’t have a place to showcase those intermediate artifacts of our work.

Our new site makes it easy for any member of Next to share a learning, a screenshot, a thought, or a full-blown essay. It could be a tiny demo, or an update to an existing project. Working for the public good — and largely in the open — is one of the key perks we enjoy at Next. We’re looking forward to sharing more of our behind the scenes with you, without needing to fit everything into a tweet-shaped chunk of content.

Under the hood, we also wanted to transition to a static site framework like Astro for ease of maintenance. Shoutout to the Astro folks, it’s so good.

Enjoy the new site, we’re excited to share more with you!

Link Apr 20

Lean Squad: Exploring Automated Software Verification with Near-Zero Human Labour

https://dsyme.net/2026/04/20/lean-squad-automated-software-verification-with-near-zero-human-labour/

What if formal verification could be fully automated — from researching the codebase, to writing specifications, to proving theorems in Lean 4 — all with near-zero human involvement? Lean Squad is a GitHub Agentic Workflow that does exactly this. Applied to three real-world codebases, it produced over 1,200 machine-checked theorems and found real bugs in a drone autopilot.

Link Mar 8

Start Your Day With Code That’s Better

https://dsyme.net/2026/03/08/start-your-day-with-code-thats-better/

What if you woke up every morning to find your repositories a little bit better than when you left them? A performance improvement here, a feature analysis there, an engineering upgrade you didn’t know was possible. That’s what automated repository maintenance with Repo Assist looks like in practice.

Link Mar 7

Adding Weighted Task Selection to a GitHub Agentic Workflow

https://dsyme.net/2026/03/07/adding-weighted-task-selection-to-a-github-agentic-workflow/

How should an automated repository assistant decide what to work on next? Round-robin treats every task as equally important regardless of repo state. A weighted approach means the agent now does the right thing more often: when there’s a mountain of unlabelled issues it labels, when the backlog is clear it invests in engineering.

Link Feb 25

Repo Assist: Crunching the Technical Debt with GitHub Agentic Workflows

https://dsyme.net/2026/02/25/repo-assist-a-repository-assistant/

Can automated repository assistants help maintainers re-engage with stale repositories weighed down by years of technical debt? Repo Assist uses GitHub Agentic Workflows to label issues, answer questions, propose fixes, and make engineering improvements — all while the maintainer stays in control through pull request review.

Link Feb 13

Automate repository tasks with GitHub Agentic Workflows

https://github.blog/ai-and-ml/automate-repository-tasks-with-github-agentic-workflows/

Coding agents bring new, magical powers to repository automation — and we believe developers, teams and communities should be empowered to shape their use according to their own needs, goals and responsibilities. Our new post on the GitHub Blog introduces GitHub Agentic Workflows as a third leg to augment CI/CD: Continuous AI.

Generative AI and Changing Inputs

https://dsyme.net/2026/01/27/generative-ai-and-changing-inputs/

Every AI feature that generates documentation, synthesizes specifications, or discovers build rules must deal with changing inputs. But when things change, competing goals emerge: freshness, stability, convergence, performance. How do we build incremental AI functions that balance these tradeoffs?

Link Oct 12, 2025

Towards Semi-automatic Agentic Performance Engineering

https://dsyme.net/2025/10/12/towards-semi-automatic-performance-engineering/

Performance engineering is stunningly hard and heterogeneous — every major piece of software is a cornucopia of delight and a vast swamp of complexity. What if coding agents could walk up to arbitrary software repositories and perform realistic, useful performance work? Sometimes it works. Sometimes it’s delusional. But the impossible is gradually revealing itself to be just partially tractable.

Link Sep 24, 2025

Intent, meet Toolchain

https://dsyme.net/2025/09/24/on-specifications-software-and-tools/

From SpecLang to Copilot Workspace to today’s app-dev toolchains — all share a common structure: Intent goes in, elaboration happens, and actuality comes out. But how do these toolchains handle change? And is the 30-year supremacy of “Code is King” really breaking down?

Link Sep 2, 2025

What Kind of Programming is Natural Language Programming?

https://dsyme.net/2025/09/02/what-kind-of-programming-is-natural-language-programming/

Natural language programming is more akin to constraint programming than to traditional precise programming. What we call ambiguity is often genuinely useful generality — and the art is often in specifying less, not more. So what kinds of natural language programming are viable, and what are the limits?

Link Aug 27, 2025

On Continuous AI for Test Improvement

https://dsyme.net/2025/08/27/on-continuous-test-improvement/

Better testing means better software. Within an hour of trialling the Daily Test Coverage Improver on three of the most popular libraries on the planet, multiple PRs improving test coverage were ready. Can Continuous AI finally help the tech industry pay off 50 years of testing debt?

Link Aug 27, 2025

On Natural Language Programming

https://dsyme.net/2025/08/27/on-natural-language-programming/

Dijkstra’s Ghost and the End of The Symbolic Supremacy. As of 2025, there is serious trouble in the kingdom of precise programming: well-written natural language is now sufficient to act as instructions for repeatedly guiding computers to achieve human-relevant tasks. Is it time to replace The Symbolic Supremacy with The Clarity Supremacy?

Link Jun 19, 2025

Introducing “Continuous AI”

https://githubnext.com/projects/continuous-ai/

Just as CI/CD transformed software development by automating integration and deployment, Continuous AI covers the ways in which AI can be used to automate and enhance collaboration workflows. It’s a new project at GitHub Next — a broad category of activities, workloads, and capabilities, rather than any single tool.