OpenDataology
Sandbox
OpenDataology is an open source dataset license compliance analysis project. Our project enables users of publicly available datasets and users who curate a dataset from multiple data sources (particularly for use as a part of machine learning models) identify the potential license compliance risks. Our project is primarily comprised of three key components.
- A dataset license compliance analysis workflow that ascertains the final allowed rights and the required obligations associated with using a publicly avialable dataset or a dataset that is curated from multiple data sources for any purpose.
- A growing database and a web portal that documents the final rights and obligations (after the license compliance analysis is conducted) associated with the datasets and the data sources analyzed in our project. The database also documents the metadata collected and used to conduct the compliance workflow
- An online license generation toolkit that creators of dataset to generate custom licenses depending on the exact rights and obligations that they want to allow (instead of having to rely of existing available and limited dataset specific licenses)
Publicly available datasets are at the heart of Open AI and machine learning software and models. Using the publicly available datsest compliantly will be one of the key aspects in enabling LF-AI's mission to build and support an open artificial intelligence (AI) and data community. This project will always remain open, transparent and accessible to both users and contributers alike.
We have identified collaboration opportunities with the following LF-AI projects
- OpenBytes
- OpenLineage
- OpenDS4All
In addition we also currently collaborate with LF product SPDX.
MIT License, https://opensource.org/licenses/MIT
The URL to location of the source code is (github): https://github.com/OpenDataology/OpenDataology
Yes - https://github.com/OpenDataology
*Do you have the GH DCO app active in the repos?
Yes
We use the GitHub Issue tracker. Our issue repo can be found at: https://github.com/OpenDataology/OpenDataology/issues
- We have a dedicated slack channel at: https://join.slack.com/t/dataset-license/shared_invite/zt-1823jgzvb-3ExLy22G4fKSaTYdXb9fYQ
- We also plan to have biweekly meeting to synchronize on the collaborations.
There are no external dependencies in our project.
- Gopi Krishnan Rajbahadur, gopikrishnanrajbahadur@gmail.com, Huawei Canada, 12 months
- Clement Li, lizi4@huawei.com, Huawei China, 12 months
- Zicheng Qu, quzicheng315@gmail.com, Huawei China, 5 months
- Daniel M German, dmg@uvic.ca, University of Victoria Canada, 12 months
- Jack jiang,zmjiang@gmail.com, York University Canada, 12 months
- Dora HU, hujing@grandall.com.cn, Grandall China, 2 months
- Zhonghua Zhu, 625945373@qq.com, Meituan China, 2 months
- Song Liu, claimonx@gmail.com, Fuzhou University China, 6 months
- Zhengcai You, youzhengcai@gmail.com, Fuzhou University China, 6 months
- Gopi Krishnan Rajbahadur, gopikrishnanrajbahadur@gmail.com, Huawei Canada, 12 months
- Clement Li, lizi4@huawei.com, Huawei China, 12 months
- Zicheng Qu, quzicheng315@gmail.com, Huawei China, 5 months
- Dora HU, hujing@grandall.com.cn, Grandall China, 2 months
- Daniel M German, dmg@uvic.ca, University of Victoria Canada, 12 months
- Jack jiang,zmjiang@gmail.com, York University Canada, 12 months
- Zhonghua Zhu, 625945373@qq.com, Meituan China, 2 months
- Song Liu, claimonx@gmail.com, Fuzhou University China, 6 months
- Zhengcai You, youzhengcai@gmail.com, Fuzhou University China, 6 months
- Boyuan Chen, chenfsd@gmail.com, Huawei Canada, 6 months
- Zhipeng Huang, zhipengh512@gmail.com, Huawei China, 8 months
- Dayi Lin,heylindayi@gmail.com, Huawei Canada, 8 months
Have the project defined the roles of contributor, committer, maintainer, etc.? Please document it in MAINTAINERS.md.
- Our roles of contributers, committers and maintainers have been defined in our governance structure which can be found at: https://github.com/OpenDataology/OpenDataology/blob/main/GOVERNANCE.md
Our contributers, committers and maintainers can be found here: https://github.com/OpenDataology/OpenDataology/blob/main/CONTRIBUTORS.md
Total number of contributors to the project including their affiliations at the time of submitting this proposal:
We have 8 contributors to the project. Our contributers come from Huawei Canada, Huawei China, Grandall China, York University Canada and University of Victoria Canada
No
Does the project have a code of conduct? If yes, please share the URL. If please created CODE_OF_CONDUCT.md and point to https://lfprojects.org/policies/code-of-conduct/. You can use conduct@lfai.foundation as email for contact on this topic.
Our code of conduct can be found here: https://github.com/OpenDataology/OpenDataology/blob/main/CODE_OF_CONDUCT.md
Confluence wiki
Project website - Do you have a web site? If no, did you reserve a and would like you to have a website created?
No, we have researved a domain name (OpenDataology.com). However, we haven't created a website yet. We will create the website in the near future.
https://github.com/OpenDataology/OpenDataology
Project governance - Do you have a working governance model for project? Please provide URL to where it is documented, typically GOVERNANCE.md.
Yes. Our governance model can be found here: https://github.com/OpenDataology/OpenDataology/blob/main/GOVERNANCE.md
• Social media accounts - Do you have any Twitter/LinkedIn/Facebook/etc. project accounts? Please provide pointers.
Twitter: @OpenDataology
Existing sponsorship (e.g., whether any organization has provided funding or other support to date, and a description of that support), if any.
The project has not received any external funding.