Benchmark runner instantiates annotators #510

bkorycki · 2024-09-27T05:19:25Z

PR #487 did not apply its test annotator instantiation change to modelbench's runner. This fixes that.

I also snuck in a bug fix for the "sxc" prompt issue that was causing some tests to fail.

github-actions · 2024-09-27T05:19:37Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

src/modelbench/benchmark_runner.py

wpietri · 2024-09-30T12:23:52Z

tests/modelbench_tests/test_benchmark_runner.py

@@ -36,26 +36,35 @@ def fake_all_secrets(value="some-value") -> RawSecrets:
    return raw_secrets


+class FakeExplodingAnnotator(FakeAnnotator):


This pollutes a global variable with test-specific data, and as a general rule tests should not have side effects.

Are you referring to the registration of the fake annotators? If so, I agree that it's problematic. However, tests' get_annotators just returns UIDs and then the runners get the actual annotators from the registry now.
Do you have suggestions to avoid test side effects on ANNOTATORS while still testing the runners' annotator-specific functionality?

In general, I prefer to avoid global variables, which neatly solves this problem. But since fixing that is probably out of scope for this, I think the next-best solution is either setup/teardown (which add the fakes and then restore state after the test has run) or mocking such that the test diverts the production code so that it uses a fake registry which is discarded after the test has run.

That makes sense! Updated.

tests/modelbench_tests/test_benchmark_runner.py

wpietri

This all looks great now.

dhosterman

👍 Looks great!

bkorycki added 2 commits September 26, 2024 19:08

Fix sxc bug

ff3a9fe

benchmark run instantiates annotators

537931e

bkorycki requested a review from a team as a code owner September 27, 2024 05:19

bkorycki mentioned this pull request Sep 27, 2024

Separate benchmarks by locale #511

Merged

bkorycki requested review from wpietri and dhosterman September 27, 2024 15:58

wpietri requested changes Sep 30, 2024

View reviewed changes

bkorycki added 5 commits September 30, 2024 14:32

Remove TODO

9fbe1fa

convinience function to get tests annotators

dbf0c49

add annotator worker test for test runner

8bd384d

remove duplicate test check

3402826

setup/tear down test annotators

e19ab58

bkorycki requested a review from wpietri October 1, 2024 22:04

wpietri approved these changes Oct 1, 2024

View reviewed changes

dhosterman approved these changes Oct 2, 2024

View reviewed changes

bkorycki merged commit 0ce1e70 into main Oct 2, 2024
4 checks passed

github-actions bot locked and limited conversation to collaborators Oct 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark runner instantiates annotators #510

Benchmark runner instantiates annotators #510

bkorycki commented Sep 27, 2024

github-actions bot commented Sep 27, 2024 •

edited

Loading

wpietri Sep 30, 2024

bkorycki Sep 30, 2024

wpietri Oct 1, 2024

bkorycki Oct 1, 2024

wpietri left a comment

dhosterman left a comment

		@@ -36,26 +36,35 @@ def fake_all_secrets(value="some-value") -> RawSecrets:
		return raw_secrets


		class FakeExplodingAnnotator(FakeAnnotator):

Benchmark runner instantiates annotators #510

Benchmark runner instantiates annotators #510

Conversation

bkorycki commented Sep 27, 2024

github-actions bot commented Sep 27, 2024 • edited Loading

wpietri Sep 30, 2024

Choose a reason for hiding this comment

bkorycki Sep 30, 2024

Choose a reason for hiding this comment

wpietri Oct 1, 2024

Choose a reason for hiding this comment

bkorycki Oct 1, 2024

Choose a reason for hiding this comment

wpietri left a comment

Choose a reason for hiding this comment

dhosterman left a comment

Choose a reason for hiding this comment

github-actions bot commented Sep 27, 2024 •

edited

Loading