Skip to content

Commit f43645d

Browse files
Update Readme.md
1 parent 2cc0238 commit f43645d

File tree

1 file changed

+6
-6
lines changed

1 file changed

+6
-6
lines changed

Readme.md

+6-6
Original file line numberDiff line numberDiff line change
@@ -161,20 +161,20 @@ In summary, our method systematically integrates advanced reasoning paradigms, d
161161
</div>
162162

163163
# Dataset
164-
[**OpenManusRL-Dataset**](https://huggingface.co/datasets/CharlieDreemur/OpenManus-RL) combines agent trajectories from [AgentInstruct](https://huggingface.co/datasets/THUDM/AgentInstruct) and [Agent-FLAN](https://huggingface.co/datasets/internlm/Agent-FLAN) with features:
164+
[**OpenManusRL-Dataset**](https://huggingface.co/datasets/CharlieDreemur/OpenManus-RL) combines agent trajectories from [AgentInstruct](https://huggingface.co/datasets/THUDM/AgentInstruct), [Agent-FLAN](https://huggingface.co/datasets/internlm/Agent-FLAN) and [AgentTraj-L(AgentGym)] with features:
165165

166166
- 🔍 **ReAct Framework** - <a href="https://react-lm.github.io/" target="_blank">Reasoning-Acting integration</a>
167167
- 🧠 **Structured Training** - Separate format/reasoning learning
168168
- 🚫 **Anti-Hallucination** - Negative samples + environment grounding
169169
- 🌐 **6 Domains** - OS, DB, Web, KG, Household, E-commerce
170170

171-
## Dataset Overview
172-
171+
### Dataset Composition
173172
| Source | Trajectories | Avg Turns | Key Features |
174173
|--------|--------------|-----------|--------------|
175-
| [AgentInstruct](https://huggingface.co/datasets/THUDM/AgentInstruct) | 1,866 | 5.24 | Multi-task QA, CoT reasoning |
176-
| [Agent-FLAN](https://huggingface.co/datasets/internlm/Agent-FLAN) | 34,442 | 3-35 | Error recovery patterns, diverse real-world tasks |
177-
| **Combined** | 36,308 | 4-20 | Enhanced generalization, broader task coverage |
174+
| [AgentInstruct](https://huggingface.co/datasets/THUDM/AgentInstruct) | 1,866 | 5.24 | Multi-task QA, CoT reasoning |
175+
| [Agent-FLAN](https://huggingface.co/datasets/internlm/Agent-FLAN) | 34442 | 3-35 | Error recovery patterns, diverse real-world tasks|
176+
[AgentTraj-L](https://huggingface.co/datasets/AgentGym/AgentTraj-L) | 14485 | 3-35 | interactive environments and tasks
177+
| **Combined** | 50793 | 4-20 | Enhanced generalization, uniform format with broader task coverage |
178178

179179
### Supported Tasks
180180
- **text-generation**: ReAct-style instruction following

0 commit comments

Comments
 (0)