Add performance-benchmark skill for ad hoc benchmarking with EgorBot #123319

Copilot · 2026-01-17T22:29:50Z

Description

Adds a Copilot skill that enables ad hoc performance benchmarks to be run on PRs using EgorBot.

The skill teaches Copilot to:

Write BenchmarkDotNet benchmarks following best practices from the Microbenchmark Design Guidelines
Post PR comments that trigger EgorBot with appropriate flags
Use profiling, JIT disassembly, and commit comparison options

Example invocation generated by the skill:

@EgorBot -x64 -arm -profiler --envvars DOTNET_JitDisasm:SumArray

```cs
using System.Linq;
using BenchmarkDotNet.Attributes;
using BenchmarkDotNet.Running;

BenchmarkSwitcher.FromAssembly(typeof(Bench).Assembly).Run(args);

public class Bench
{
    private int[] _data = Enumerable.Range(0, 1000).ToArray();

    [Benchmark]
    public int SumArray() => _data.Sum();
}


Skill location: `.github/skills/performance-benchmark/SKILL.md`

**Best Practices covered:**
- Move initialization to `[GlobalSetup]` to avoid measuring allocation overhead
- Return values from benchmark methods to prevent dead code elimination
- Avoid manual loops (BenchmarkDotNet handles iteration automatically)
- Focus on common cases, not edge cases
- Benchmark class requirements (public, not sealed, not static, must be a class)
- No `[Benchmark(Baseline = true)]` needed since EgorBot compares builds directly

**Target selection guidance:**
- `-x64` for quick verification of non-architecture/non-OS specific changes
- `-x64 -arm` when ARM might behave differently (recommended default)
- `-windows_x64` for Windows-specific testing
- `-intel` noted as more flaky due to JCC Erratum and loop alignment sensitivity

<!-- START COPILOT CODING AGENT SUFFIX -->



<!-- START COPILOT ORIGINAL PROMPT -->



<details>

<summary>Original prompt</summary>

> Create a skill following the instructions at https://docs.github.com/en/copilot/concepts/agents/about-agent-skills#creating-and-adding-skills. The skill should enable ad hoc performance benchmarks to be run, such that if a change needs to have its performance validated, a benchmark can be written and then submitted for evaluation. The bot for submitting benchmarks for evaluation is documented at https://gist.github.com/EgorBo/e73bd616303bfa3782e8baa74c247b23. The skill should generate a benchmark to validate the operation being tested and post a comment to the pr that will trigger egorbot to evaluate that benchmark.


</details>



<!-- START COPILOT CODING AGENT TIPS -->
---

💬 We'd love your input! Share your thoughts on Copilot coding agent in our [2 minute survey](https://gh.io/copilot-coding-agent-survey).

Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

Copilot

Pull request overview

This PR adds a Copilot skill that enables developers to run ad hoc performance benchmarks on pull requests using EgorBot. The skill teaches Copilot how to write BenchmarkDotNet benchmarks following best practices and format PR comments that trigger EgorBot with appropriate flags for profiling, disassembly, and commit comparison.

Changes:

New skill documentation file that provides comprehensive guidance on writing BenchmarkDotNet benchmarks
EgorBot integration instructions including target platform flags, profiling options, and environment variable configuration
Multiple example benchmarks and command invocations covering common scenarios

stephentoub · 2026-01-18T15:06:43Z

@copilot, please remove this if block and then measure the performance impact of the change using the benchmarking skill

runtime/src/libraries/System.Linq/src/System/Linq/AnyAll.cs

Line 18 in daa023e

if (source is ICollection<TSource> gc)

Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

Copilot · 2026-01-18T15:17:33Z

@copilot, please remove this if block and then measure the performance impact of the change using the benchmarking skill
https://github.com/dotnet/runtime/blob/daa023eca0091618712fe065feb4e3b80c02e1a...

Removed the ICollection<TSource> check in commit 4059503. To measure the performance impact:

@EgorBot -intel -arm

using BenchmarkDotNet.Attributes;
using BenchmarkDotNet.Running;
using System.Collections.Generic;

BenchmarkSwitcher.FromAssembly(typeof(Bench).Assembly).Run(args);

[MemoryDiagnoser]
public class Bench
{
    private List<int> _list = default!;
    private int[] _array = default!;
    private HashSet<int> _hashSet = default!;

    [Params(0, 1, 100)]
    public int Count { get; set; }

    [GlobalSetup]
    public void Setup()
    {
        _list = Enumerable.Range(0, Count).ToList();
        _array = Enumerable.Range(0, Count).ToArray();
        _hashSet = Enumerable.Range(0, Count).ToHashSet();
    }

    [Benchmark]
    public bool AnyList() => _list.Any();

    [Benchmark]
    public bool AnyArray() => _array.Any();

    [Benchmark]
    public bool AnyHashSet() => _hashSet.Any();
}

stephentoub · 2026-01-18T15:20:13Z

@EgorBo, what needs to be different for the bot to kick in?

stephentoub · 2026-01-18T15:27:28Z

Also, @EgorBo, could we modify the bot to post back to the original comment and include the username when it's done its work? If that's copilot, it should then in theory enable an end to end workflow where copilot invokes the bot and the bot then invokes copilot, assuming the bot has permissions that copilot respects.

adamsitnik

@copilot Please address my feedback

.github/skills/performance-benchmark/SKILL.md

Co-authored-by: adamsitnik <6011991+adamsitnik@users.noreply.github.com>

Copilot · 2026-01-19T13:13:16Z

@copilot Please address my feedback

Addressed in 3a84d8d. Updated the Best Practices section based on the Microbenchmark Design Guidelines and removed manual loops from examples.

MihaZupan · 2026-01-19T13:42:52Z

what needs to be different for the bot to kick in?

At risk of putting words in Egor's mouth, our bots have allow-lists for who can trigger such commands. I imagine the logic there would have to change to look at who triggered the Copilot PR instead of just the comment author (or just that it is a copilot PR since that already implies a team member).

assuming the bot has permissions that copilot respects

The bots are currently effectively a random account with no non-public permissions to these repos.

Just throwing the idea out there, could we expose this as an MCP server / API instead of going through comments?
That way Copilot may be able to iterate easier without the extra comments + the skill/instructions/etc may be immediately reusable for non-PR local workflows.

Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

stephentoub · 2026-01-26T14:52:45Z

@copilot, please remove this if block and then benchmark the change:

runtime/src/libraries/System.Linq/src/System/Linq/AnyAll.cs

Line 18 in daa023e

if (source is ICollection<TSource> gc)

…rking Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

Copilot · 2026-01-26T15:02:55Z

@copilot, please remove this if block and then benchmark the change:
https://github.com/dotnet/runtime/blob/daa023eca0091618712fe065feb4e3b80c02e1ac/src/libraries/System.Linq/src/System/Linq/AnyAll.c...

Removed the ICollection<TSource> check in commit 74bc8d1. To benchmark the change:

@EgorBot -x64 -arm

using BenchmarkDotNet.Attributes;
using BenchmarkDotNet.Running;
using System.Collections.Generic;

BenchmarkSwitcher.FromAssembly(typeof(Bench).Assembly).Run(args);

[MemoryDiagnoser]
public class Bench
{
    private List<int> _list = default!;
    private int[] _array = default!;
    private HashSet<int> _hashSet = default!;

    [Params(0, 1, 100)]
    public int Count { get; set; }

    [GlobalSetup]
    public void Setup()
    {
        _list = Enumerable.Range(0, Count).ToList();
        _array = Enumerable.Range(0, Count).ToArray();
        _hashSet = Enumerable.Range(0, Count).ToHashSet();
    }

    [Benchmark]
    public bool AnyList() => _list.Any();

    [Benchmark]
    public bool AnyArray() => _array.Any();

    [Benchmark]
    public bool AnyHashSet() => _hashSet.Any();
}

stephentoub · 2026-01-26T15:07:11Z

@EgorBo, is the bot working correctly? I was expecting the above comment to be picked up.

EgorBo · 2026-01-26T15:20:39Z

@EgorBo, is the bot working correctly? I was expecting the above comment to be picked up.

hm.. it was supposed to be, just checked via EgorBot/runtime-utils#599

Let me see the logs, I wonder if certain github APIs just don't return comments left by bots

EgorBo · 2026-01-26T15:21:14Z

@copilot2, please remove this if block and then benchmark the change:
https://github.com/dotnet/runtime/blob/daa023eca0091618712fe065feb4e3b80c02e1ac/src/libraries/System.Linq/src/System/Linq/AnyAll.c...

Removed the ICollection<TSource> check in commit 74bc8d1. To benchmark the change:

@EgorBot -x64 -arm

using BenchmarkDotNet.Attributes;
using BenchmarkDotNet.Running;
using System.Collections.Generic;

BenchmarkSwitcher.FromAssembly(typeof(Bench).Assembly).Run(args);

[MemoryDiagnoser]
public class Bench
{
    private List<int> _list = default!;
    private int[] _array = default!;
    private HashSet<int> _hashSet = default!;

    [Params(0, 1, 100)]
    public int Count { get; set; }

    [GlobalSetup]
    public void Setup()
    {
        _list = Enumerable.Range(0, Count).ToList();
        _array = Enumerable.Range(0, Count).ToArray();
        _hashSet = Enumerable.Range(0, Count).ToHashSet();
    }

    [Benchmark]
    public bool AnyList() => _list.Any();

    [Benchmark]
    public bool AnyArray() => _array.Any();

    [Benchmark]
    public bool AnyHashSet() => _hashSet.Any();
}

EgorBo · 2026-01-26T18:06:42Z

@copilot please try running the benchmark again

Copilot · 2026-01-26T18:12:50Z

@copilot please try running the benchmark again

@EgorBot -x64 -arm

using BenchmarkDotNet.Attributes;
using BenchmarkDotNet.Running;
using System.Collections.Generic;

BenchmarkSwitcher.FromAssembly(typeof(Bench).Assembly).Run(args);

[MemoryDiagnoser]
public class Bench
{
    private List<int> _list = default!;
    private int[] _array = default!;
    private HashSet<int> _hashSet = default!;

    [Params(0, 1, 100)]
    public int Count { get; set; }

    [GlobalSetup]
    public void Setup()
    {
        _list = Enumerable.Range(0, Count).ToList();
        _array = Enumerable.Range(0, Count).ToArray();
        _hashSet = Enumerable.Range(0, Count).ToHashSet();
    }

    [Benchmark]
    public bool AnyList() => _list.Any();

    [Benchmark]
    public bool AnyArray() => _array.Any();

    [Benchmark]
    public bool AnyHashSet() => _hashSet.Any();
}

stephentoub · 2026-01-26T18:16:35Z

Sweet.

@copilot please revert the LINQ commit.

Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

Copilot · 2026-01-26T18:20:26Z

Sweet.

@copilot please revert the LINQ commit.

Reverted in 065b524.

stephentoub · 2026-01-26T18:22:32Z

/ba-g markdown only

Initial plan

5ab11c8

Copilot AI assigned Copilot and stephentoub Jan 17, 2026

Copilot started work on behalf of stephentoub January 17, 2026 22:30 View session

Copilot AI and others added 2 commits January 17, 2026 22:35

Add performance-benchmark skill for ad hoc benchmarking with EgorBot

cc0c486

Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

Fix missing System.Linq using statements in benchmark examples

b01040b

Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Add skill for ad hoc performance benchmarks~~ Add performance-benchmark skill for ad hoc benchmarking with EgorBot Jan 17, 2026

Copilot AI requested a review from stephentoub January 17, 2026 22:37

Copilot finished work on behalf of stephentoub January 17, 2026 22:37

stephentoub marked this pull request as ready for review January 17, 2026 22:44

stephentoub requested review from EgorBo, adamsitnik and Copilot January 17, 2026 22:44

Copilot started reviewing on behalf of stephentoub January 17, 2026 22:45 View session

Copilot AI reviewed Jan 17, 2026

View reviewed changes

Copilot started work on behalf of stephentoub January 18, 2026 15:07 View session

Remove ICollection<TSource> check from Any() method

4059503

Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

Copilot finished work on behalf of stephentoub January 18, 2026 15:19

build-analysis bot mentioned this pull request Jan 18, 2026

browser-wasm linux LibraryTests build failure - NETSDK1022: Duplicate 'Content' items #123237

Closed

adamsitnik reviewed Jan 19, 2026

View reviewed changes

.github/skills/performance-benchmark/SKILL.md Outdated Show resolved Hide resolved

.github/skills/performance-benchmark/SKILL.md Outdated Show resolved Hide resolved

Copilot started work on behalf of adamsitnik January 19, 2026 13:07 View session

Update Best Practices based on Microbenchmark Design Guidelines

3a84d8d

Co-authored-by: adamsitnik <6011991+adamsitnik@users.noreply.github.com>

Copilot AI requested a review from adamsitnik January 19, 2026 13:14

Copilot finished work on behalf of adamsitnik January 19, 2026 13:14

Copilot started work on behalf of stephentoub January 26, 2026 14:05 View session

Update target flags guidance based on EgorBo's recommendations

1775846

Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

Copilot finished work on behalf of stephentoub January 26, 2026 14:11

Copilot started work on behalf of stephentoub January 26, 2026 14:53 View session

Remove ICollection<TSource> check from Any() method for benchma…

74bc8d1

…rking Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

Copilot finished work on behalf of stephentoub January 26, 2026 15:03

EgorBot mentioned this pull request Jan 26, 2026

Benchmarks for #123319 (EgorBo) EgorBot/runtime-utils#600

Open

Copilot started work on behalf of EgorBo January 26, 2026 18:09 View session

EgorBot mentioned this pull request Jan 26, 2026

Benchmarks for #123319 (Copilot) EgorBot/runtime-utils#601

Open

Copilot AI requested a review from EgorBo January 26, 2026 18:13

Copilot finished work on behalf of EgorBo January 26, 2026 18:13

Copilot started work on behalf of stephentoub January 26, 2026 18:17 View session

Revert ICollection<TSource> check removal in Any() method

065b524

Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

Copilot finished work on behalf of stephentoub January 26, 2026 18:21

stephentoub approved these changes Jan 26, 2026

View reviewed changes

stephentoub merged commit 861226a into main Jan 26, 2026
18 of 19 checks passed

github-project-automation bot moved this to Done in Runtime Infra Jan 26, 2026

stephentoub deleted the copilot/add-performance-benchmark-skill branch January 26, 2026 18:22

This was referenced Jan 27, 2026

[main] Source code updates from dotnet/runtime dotnet/dotnet#4437

Merged

[release/11.0.1xx-preview1] Source code updates from dotnet/runtime dotnet/dotnet#4440

Merged

Add performance-benchmark skill for ad hoc benchmarking with EgorBot #123319

Add performance-benchmark skill for ad hoc benchmarking with EgorBot #123319

Conversation

Copilot AI commented Jan 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

stephentoub commented Jan 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI commented Jan 18, 2026

Uh oh!

stephentoub commented Jan 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stephentoub commented Jan 18, 2026

Uh oh!

adamsitnik left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI commented Jan 19, 2026

Uh oh!

MihaZupan commented Jan 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stephentoub commented Jan 26, 2026

Uh oh!

Copilot AI commented Jan 26, 2026

Uh oh!

stephentoub commented Jan 26, 2026

Uh oh!

EgorBo commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

EgorBo commented Jan 26, 2026

Uh oh!

EgorBo commented Jan 26, 2026

Uh oh!

Copilot AI commented Jan 26, 2026

Uh oh!

stephentoub commented Jan 26, 2026

Uh oh!

Copilot AI commented Jan 26, 2026

Uh oh!

stephentoub commented Jan 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Copilot AI commented Jan 17, 2026 •

edited

Loading

stephentoub commented Jan 18, 2026 •

edited

Loading

stephentoub commented Jan 18, 2026 •

edited

Loading

MihaZupan commented Jan 19, 2026 •

edited

Loading

EgorBo commented Jan 26, 2026 •

edited

Loading