Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docs:data distillation pipline for distilling high-quality maths reasoning data with thought process (Long Cot data)from deepseek R1 #1532

Merged
merged 3 commits into from
Feb 3, 2025

Conversation

zjrwtx
Copy link
Collaborator

@zjrwtx zjrwtx commented Jan 31, 2025

Description

Learn how to set up and leverage CAMEL's data distillation pipline for distilling high-quality maths reasoning data with thought process (Long Cot data)from deepseek R1, and uploading the results to Hugging Face.

  • I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Put an x in all the boxes that apply:

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds core functionality)
  • Breaking change (fix or feature that would cause existing functionality to change)
  • Documentation (update in the documentation)
  • Example (update in the folder of example)

Implemented Tasks

  • Subtask 1
  • Subtask 2
  • Subtask 3

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

  • I have read the CONTRIBUTION guide. (required)
  • My change requires a change to the documentation.
  • I have updated the tests accordingly. (required for a bug fix or a new feature)
  • I have updated the documentation accordingly.

@zjrwtx zjrwtx requested a review from Wendong-Fan January 31, 2025 18:47
@zjrwtx zjrwtx self-assigned this Jan 31, 2025
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@zjrwtx zjrwtx added Data Related to camel data processing cookbook labels Jan 31, 2025
@Wendong-Fan Wendong-Fan added this to the Sprint 22 milestone Feb 3, 2025
@Wendong-Fan Wendong-Fan merged commit c572a5c into master Feb 3, 2025
5 of 6 checks passed
@Wendong-Fan Wendong-Fan deleted the distillation_cookbook branch February 3, 2025 14:55
apokryphosx pushed a commit that referenced this pull request Feb 11, 2025
…oning data with thought process (Long Cot data)from deepseek R1 (#1532)

Co-authored-by: “yifeng.wang” <“3038880699@qq.com;q:wqqgit config --global user.name “yifeng.wang”git config --global user.email “3038880699@qq.com>
Co-authored-by: Wendong <w3ndong.fan@gmail.com>
Co-authored-by: Wendong-Fan <133094783+Wendong-Fan@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cookbook Data Related to camel data processing
Projects
Status: No status
Development

Successfully merging this pull request may close these issues.

2 participants