CompilerGym Release v0.2.0 #434

ChrisCummins · 2021-09-27T15:37:26Z

This release adds two new compiler optimization problems to CompilerGym: GCC command line flag optimization and CUDA loop nest optimization.

[GCC] A new gcc-v0 environment, authored by @hughleat, exposes the command line flags of GCC as a reinforcement learning environment. GCC is a production-grade compiler for C and C++ used throughout industry. The environment provides several datasets and a large, high dimensional action space that works on several GCC versions. For further details check out the reference documentation.
[loop_tool] A new loop_tool-v0 environment, authored by @bwasti, provides an experimental intermediate representation of n-dimensional data computation that can be lowered to both CPU and GPU backends. This provides a reinforcement learning environment for manipulating nests of loop computations to maximize throughput. For further details check out the reference documentation.

Other highlights of this release include:

[Docker] Published a chriscummins/compiler_gym docker image that can be used to run CompilerGym services in standalone isolated containers (#424).
[LLVM] Fixed a bug in the experimental Runtime observation space that caused observations to slow down over time (#398).
[LLVM] Added a new utility module to compute observations from bitcodes (#405).
Overhauled the continuous integration services to reduce computational requirements by 59.4% while increasing test coverage (#392).
Improved error reporting if computing an observation fails (#380).
Changed the return type of compiler_gym.random_search() to a CompilerEnv (#387).
Numerous other bug fixes and improvements.

Many thanks to code contributors: @thecoblack, @bwasti, @hughleat, and @sahirgomez1!

Replace the start/end/undo/step endpoints with a single "step" function that takes all of the variables needed to describe an environment state (then benchmark, reward signal, and full actions history). This replaces the session-based API which is error prone and hard to scale. Note that this new stateless API is only a proof-of-concept implementation, as on every "step" it replays an entire episode. In the future we will change this to maintain a pool of live environments that can be used to serve stateless API requests more efficiently.

…i-V4

Replace the start/end/undo/step endpoints with a single "step" function that takes all of the variables needed to describe an environment state (then benchmark, reward signal, and full actions history). This replaces the session-based API which is error prone and hard to scale. Note that this new stateless API is only a proof-of-concept implementation, as on every "step" it replays an entire episode. In the future we will change this to maintain a pool of live environments that can be used to serve stateless API requests more efficiently.

In addition to plotting reward history, the frontend also shows the trend of instcount and autophase features. This means that all_states=1 needs to return everything for that case.

merge chris commits

Release v0.1.10 (2021-09-08)

m4 is needed to build Csmith from source.

This patch improves the error reporting when computing an observation fails. First, if the service produces an unexpected number of observations, a ServiceError is raised, rather than the previous assertion. Second, if the environment reports that it has reached a terminal state, a ServiceError is raised, containing the error details produced by the environment.

Small documentation improvements for build dependencies

Improved error reporting from ObservationView.__getitem__().

This patch refactors the code pattern `try: ...; finally: env.close()` to instead use the `with gym.make(...):` pattern. This is preferred because it automatically handles calling `close()`.

Use `with` statement in place of try/finally for envs.

Regression introduced in #384.

[tests] Fix gym compatibility test.

This commit combines code from: Hugh Leather <hleather@fb.com> Chris Cummins <cummins@fb.com> It is mostly Hugh's work, with a small amount of fixes from Chris, and a couple of extra datasets. Issue #383.

Issue #383.

This can be useful for debugging services: $ COMPILER_GYM_DEBUG=4 python -m compiler_gym.bin.service --env=llvm-v0 --run_on_port=8000 Issue #318.

[loop_tool] Add integration and tests

This removes the `examples/` tests from the bazel build system. Instead, examples are tested by simply running pytest in the examples directory. The `make install-test` target still runs the examples tests. One exception is `examples/example_compiler_gym_service` which can only be run through bazel because of its use of compiled C++ code. Issue #412 will be used to track progress on this.

[ci] Add a codeql workflow for Python.

Add support for running CompilerGym environments from docker containers

Un-bazel-ify the examples

[env] Fix a bug in reset() failure handling.

codecov-commenter · 2021-09-27T15:52:33Z

Codecov Report

Merging #434 (f511941) into stable (e48d497) will decrease coverage by 5.63%.
The diff coverage is 88.82%.

@@            Coverage Diff             @@
##           stable     #434      +/-   ##
==========================================
- Coverage   85.87%   80.23%   -5.64%     
==========================================
  Files          87      104      +17     
  Lines        4757     5966    +1209     
==========================================
+ Hits         4085     4787     +702     
- Misses        672     1179     +507

Impacted Files	Coverage Δ
compiler_gym/bin/random_replay.py	`0.00% <0.00%> (ø)`
compiler_gym/bin/random_search.py	`0.00% <0.00%> (ø)`
compiler_gym/envs/llvm/llvm_benchmark.py	`42.85% <0.00%> (-44.47%)`	⬇️
compiler_gym/random_replay.py	`100.00% <ø> (ø)`
compiler_gym/service/__init__.py	`100.00% <ø> (ø)`
compiler_gym/service/proto/__init__.py	`100.00% <ø> (ø)`
compiler_gym/envs/llvm/compute_observation.py	`28.57% <28.57%> (ø)`
compiler_gym/envs/llvm/datasets/csmith.py	`55.88% <33.33%> (-32.36%)`	⬇️
compiler_gym/util/flags/benchmark_from_flags.py	`80.00% <40.00%> (+5.00%)`	⬆️
compiler_gym/bin/service.py	`76.27% <41.66%> (-2.17%)`	⬇️
... and 57 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e48d497...f511941. Read the comment docs.

@hughleat

This release adds two new compiler optimization problems to CompilerGym: GCC command line flag optimization and CUDA loop nest optimization. - [GCC] A new `gcc-v0` environment, authored by @hughleat, exposes the command line flags of GCC as a reinforcement learning environment. GCC is a production-grade compiler for C and C++ used throughout industry. The environment provides several datasets and a large, high dimensional action space that works on several GCC versions. For further details check out the reference documentation: https://facebookresearch.github.io/CompilerGym/envs/gcc.html - [loop_tool] A new `loop_tool-v0` environment, authored by @bwasti, provides an experimental intermediate representation of *n*-dimensional data computation that can be lowered to both CPU and GPU backends. This provides a reinforcement learning environment for manipulating nests of loop computations to maximize throughput. For further details check out the reference documentation: https://facebookresearch.github.io/CompilerGym/envs/loop_tool.html Other highlights of this release include: - [Docker] Published a chriscummins/compiler_gym docker image that can be used to run CompilerGym services in standalone isolated containers. - [LLVM] Fixed a bug in the experimental `Runtime` observation space that caused observations to slow down over time. - [LLVM] Added a new utility module to compute observations from bitcodes. - Overhauled the continuous integration services to reduce computational requirements by 59.4% while increasing test coverage. - Improved error reporting if computing an observation fails. - Changed the return type of compiler_gym.random_search() to a CompilerEnv. - Numerous other bug fixes and improvements. Many thanks to code contributors: @thecoblack, @bwasti, @hughleat, and @sahirgomez1!

We will be running the full CI workflow on every push / PR, so repeating every test on the build artifacts seems wasteful. Instead just run the examples tests.

ChrisCummins and others added 30 commits August 25, 2021 13:41

Merge remote-tracking branch 'chrisrepo/www-stateless' into www-ui-ap…

f96aa9f

…i-V4

[www] Return instcount and autophase history when all_states=1.

c89d3d6

In addition to plotting reward history, the frontend also shows the trend of instcount and autophase features. This means that all_states=1 needs to return everything for that case.

Refactors part of frontend code to use API V4

f16f209

added latest push

01eb239

fixed some api comments

d59bc24

Merge branch 'www-stateless' into www-ui-api-V4

7fe4a7e

merge chris commits

Merge pull request #378 from facebookresearch/stable

58b5831

Release v0.1.10 (2021-09-08)

Add m4 to list of build dependencies.

7e0cdbd

m4 is needed to build Csmith from source.

[docs] Small wording improvements to install instructions.

50bc503

[tests] Fix name of mock class.

32c8294

Merge pull request #379 from ChrisCummins/build-deps

c824115

Small documentation improvements for build dependencies

Merge pull request #380 from ChrisCummins/observation-error-message

5354ddd

Improved error reporting from ObservationView.__getitem__().

Use with statement in place of try/finally for envs.

12b4414

This patch refactors the code pattern `try: ...; finally: env.close()` to instead use the `with gym.make(...):` pattern. This is preferred because it automatically handles calling `close()`.

Merge pull request #384 from ChrisCummins/with-statement

58f4587

Use `with` statement in place of try/finally for envs.

Refactors frontend to use Flask API V4

befa6fb

adds spinner

ea2fa69

[tests] Fix gym compatibility test.

671dadc

Regression introduced in #384.

Merge pull request #390 from ChrisCummins/fix-regression

a8408e8

[tests] Fix gym compatibility test.

Add a new GCC environment.

4be09db

This commit combines code from: Hugh Leather <hleather@fb.com> Chris Cummins <cummins@fb.com> It is mostly Hugh's work, with a small amount of fixes from Chris, and a couple of extra datasets. Issue #383.

Remove hardcoded GCC name.

6afb7c9

Issue #383.

Fix specification of GCC binary.

435bcc7

Issue #383.

[gcc] Small style fixes.

6a96c4d

Issue #383.

[gcc] Clarify error message.

04f06c9

Issue #383.

[gcc] Skip tests if docker is not available.

eae86f9

Issue #383.

[ci] Install docker for coverage tests.

d377000

Issue #383.

[ci] Fix yaml file extension.

9a06c30

[gcc] Add GCC binary tests.

d6dd504

Issue #383.

ChrisCummins and others added 19 commits September 23, 2021 16:19

[bin] Add a --run_on_port flag to service.

6cfb8fc

This can be useful for debugging services: $ COMPILER_GYM_DEBUG=4 python -m compiler_gym.bin.service --env=llvm-v0 --run_on_port=8000 Issue #318.

[datasets] Make Benchmark.from_file() compatible with remote services.

7f0c25c

[docker] Add a docker image that contains a locally built CompilerGym.

430ed58

[docker] Add a docker image that installs compiler_gym from pypi.

ecb8d69

[Makefile] Tidy up build dir after docker build.

ee42fd5

Merge pull request #298 from bwasti/loop_tool

9e7dfac

[loop_tool] Add integration and tests

Un-bazel-ify examples.

42edac8

[ci] Run examples tests in-tree.

2db64da

[util] Add a --seed flag.

e0f3627

[util] FIx --benchmark flag when value is a file:/// URI.

3e8f888

[examples] Update the makefile integration example.

1a15d6c

[tests] Permit GCC env failure in examples test.

56e25bd

[env] Fix a bug in reset() failure handling.

8525319

[ci] Add a codeql workflow for Python.

2e3f4a0

Merge pull request #433 from facebookresearch/codeql

901022e

[ci] Add a codeql workflow for Python.

Merge pull request #424 from ChrisCummins/docker

60d9e8c

Add support for running CompilerGym environments from docker containers

Merge pull request #428 from ChrisCummins/examples-tests

408d09f

Un-bazel-ify the examples

Merge pull request #430 from ChrisCummins/env-reset-fix

8c66b24

[env] Fix a bug in reset() failure handling.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 27, 2021

ChrisCummins marked this pull request as ready for review September 28, 2021 15:07

ChrisCummins added 4 commits September 29, 2021 12:04

[ci] Automatically build release artifacts on push to stable.

b0a075f

[Makefile] Split example tests into a separate target.

ed5ced8

[ci] Only run examples tests as post-build verification.

4b3d220

We will be running the full CI workflow on every push / PR, so repeating every test on the build artifacts seems wasteful. Instead just run the examples tests.

[git] Don't track in-tree .coverage files.

f511941

ChrisCummins force-pushed the release-v0.2.0 branch from 8cc95e2 to f511941 Compare September 29, 2021 12:27

ChrisCummins merged commit 9002afc into stable Sep 29, 2021

ChrisCummins deleted the release-v0.2.0 branch September 29, 2021 16:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CompilerGym Release v0.2.0 #434

CompilerGym Release v0.2.0 #434

ChrisCummins commented Sep 27, 2021 •

edited

Loading

codecov-commenter commented Sep 27, 2021 •

edited

Loading

CompilerGym Release v0.2.0 #434

CompilerGym Release v0.2.0 #434

Conversation

ChrisCummins commented Sep 27, 2021 • edited Loading

codecov-commenter commented Sep 27, 2021 • edited Loading

Codecov Report

ChrisCummins commented Sep 27, 2021 •

edited

Loading

codecov-commenter commented Sep 27, 2021 •

edited

Loading