Skip to content

Commit f86101c

Browse files
authored
Merge branch 'main' into feat/integrate_camel
2 parents c764a8c + ef861c7 commit f86101c

File tree

4 files changed

+36
-9
lines changed

4 files changed

+36
-9
lines changed

README.md

+14-3
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,17 @@
66
[![Wechat][wechat-image]][wechat-url]
77
[![Twitter][twitter-image]][twitter-url]
88

9+
<p align="center">
10+
<a href="https://crab.camel-ai.org/">Website & Demos</a> |
11+
<a href="https://www.camel-ai.org/post/crab">Blog</a> |
12+
<a href="https://dandansamax.github.io/posts/crab-paper/">Chinese Blog</a> |
13+
<a href="https://www.camel-ai.org/">CAMEL-AI</a>
14+
</p>
15+
16+
<p align="center">
17+
<img src='https://raw.githubusercontent.com/camel-ai/crab/main/assets/CRAB_logo1.png' width=800>
18+
</p>
19+
920
## Overview
1021

1122
CRAB is a framework for building LLM agent benchmark environments in a Python-centric way.
@@ -18,10 +29,10 @@ CRAB is a framework for building LLM agent benchmark environments in a Python-ce
1829

1930
⚙ ️Easy-to-use Configuration
2031
* Add a new action by simply adding a `@action` decorator on a Python function.
21-
* Deine the environment by integrating several actions together.
32+
* Define the environment by integrating several actions together.
2233

2334
📐 Novel Benchmarking Suite
24-
* Define tasks and the corresponding evlauators in an intuitive Python-native way.
35+
* Define tasks and the corresponding evaluators in an intuitive Python-native way.
2536
* Introduce a novel graph evaluator method providing fine-grained metrics.
2637

2738
## Installation
@@ -72,4 +83,4 @@ Please cite [our paper](https://arxiv.org/abs/2407.01511) if you use anything re
7283
[twitter-url]: https://twitter.com/CamelAIOrg
7384
[twitter-image]: https://img.shields.io/twitter/follow/CamelAIOrg?style=social&color=brightgreen&logo=twitter
7485
[arxiv-image]: https://img.shields.io/badge/arXiv-2407.01511-b31b1b.svg
75-
[arxiv-url]: https://arxiv.org/abs/2407.01511
86+
[arxiv-url]: https://arxiv.org/abs/2407.01511

assets/CRAB_logo1.png

30.3 KB
Loading

crab-benchmark-v0/docs/environment_local_setup.md

+21-5
Original file line numberDiff line numberDiff line change
@@ -30,18 +30,34 @@ chmod +x ubuntu_env_init.sh
3030

3131
The VM will reboot after initilization. After rebooting, remember its ip address.
3232

33+
34+
## Install ADB
35+
36+
Download and install ADB from its [official website](https://developer.android.com/tools/releases/platform-tools).
37+
3338
## Install Android Emulator
3439

35-
Download the newest version of [Android Studio](https://developer.android.com/studio). Install it.
40+
You can use emulators in [Android Studio](https://developer.android.com/studio) to simulate an Android device if you
41+
don't want to use a physical one.
3642

37-
Open Android studio and use build-in device manager to create a Pixel 8 Pro with system image release "R".
43+
To create a new virtual device, open Android Studio and use its built-in device manager to create a Pixel 8 Pro with
44+
system image release "R".
45+
46+
> Note that the benchmark on our side runs on a Google Pixel 8 Pro with system image release "R". However, cases are
47+
> noticed that Google API Level 30 may not work properly when trying to enable USB debugging mode. If such issues are
48+
> encountered, you can try switch to releases of lower API levels (e.g. "Q").
3849
3950
![](./assets/android_1.png)
4051

4152
![](./assets/android_2.png)
4253

43-
Then boot it.
54+
Then you can boot the device. To check if it's all set, run
4455

45-
## Install ADB
56+
```shell
57+
adb devices
58+
```
59+
60+
You should see the device in the list.
4661

47-
Download and install ADB from its [official website](https://developer.android.com/tools/releases/platform-tools)
62+
> Important: ADB won't work normally if you see an `unauthorized` tag after the device ID. To solve this, enable both
63+
> the developer mode and USB debugging mode in the device.

crab/core/experiment.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -105,7 +105,7 @@ def init_log_dir(self):
105105
self.task_info_dir.mkdir(exist_ok=True, parents=True)
106106
self.write_task_info_json(self.task_info_dir / "task_info.json")
107107

108-
self.time_now = datetime.now().strftime("%Y-%m-%d_%H:%M:%S")
108+
self.time_now = datetime.now().strftime("%Y-%m-%d_%H-%M-%S")
109109
self.current_experiment_dir = (
110110
self.task_info_dir / f"{self.agent_policy.__class__.__name__}"
111111
f"({self.agent_policy.get_backend_model_name()})" / self.time_now

0 commit comments

Comments
 (0)