Data Contribution Terms
Thank you for contributing to the Oopsie Data effort. Before registering your laboratory as a contributor, please read the following terms carefully. By submitting a registration form, the registering representative confirms that they have read, understood, and agreed to these terms on behalf of their laboratory.
1. Laboratory Representative Responsibility
The person completing the registration form acts as the designated representative for their laboratory. By registering, this representative:
- Confirms they have authority to agree to these terms on behalf of their laboratory and institution.
- Accepts responsibility for ensuring that all local contributors within their laboratory are informed of and agree to the licensing and privacy terms described below before submitting any data.
- Agrees to provide an accurate list of all local contributors and their ORCID iDs to the project team. This list will be included in dataset release metadata to ensure proper attribution.
- Agrees to notify the project team of any changes to their contributor list or contact information.
2. Data Licensing
All data contributed to this project is released to the public under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This means that anyone may freely use, share, and adapt the contributed data, provided that appropriate credit is given to the contributing laboratory. By submitting data, contributors agree to this license for all episodes submitted under their laboratory identifier.
If your institution has restrictions on open data release, please consult your research office before registering.
3. Privacy and Personally Identifiable Information (PII)
3.1 Contributor Responsibilities
Contributors are responsible for reviewing their data before submission. You should not submit data in which:
- Individuals have a reasonable expectation of privacy.
- Privacy protection is a strict legal or institutional requirement.
- Sensitive or confidential information about people, facilities, or equipment is present.
While our curation pipeline includes a step to detect and blur faces and other identifiable information, this processing is conducted on a best-effort basis. It does not constitute a guarantee of complete anonymization, and the project team cannot be held responsible for residual PII that is not detected or removed.
3.2 Project Team Processing
As part of the biweekly curation process, the project team will:
- Review video data for faces and identifying signage.
- Apply automated and manual blurring where PII is detected.
3.3 Acknowledgment
By submitting data, contributors acknowledge that:
- Residual identifiable information may remain in publicly released data despite best-effort processing.
- The project team will not be held liable for PII that is not detected during curation.
- Contributors bear primary responsibility for ensuring their submissions are appropriate for open release.
3.4 Pseudonomyzation
We collect names of robot operators and annotators. This is relevant information to establish the provenance of the data. However, if a contributor does not want to have their name published alongside the data, we are happy to accept pseudonyms. Please ensure that the pseudonym is consistent across all collected episodes and annotations.
4. Raw Data Archive
A restricted-access archive of unprocessed submitted data will be administered by the University of Texas at Austin. This archive is not publicly accessible and is available only to researchers who have agreed to a Data Use Agreement (DUA). The existence and scope of this archive is documented in the public dataset metadata.
5. Attribution and Credit
Contributing laboratories will be credited in every public dataset release that includes their data. Credit is provided through:
- Laboratory name and affiliation in the dataset metadata.
- ORCID iDs of individual contributors listed in the release notes.
- A dedicated contributors page on this website.
Labs which contribute a significant amount of data will be invited to become co-authors on an eventual publication of this dataset at a conference or journal.
6. Data Withdrawal
If you wish to withdraw previously submitted data, please contact the project team at oopsie-team@googlegroups.com. Withdrawal requests will be processed in the next scheduled curation cycle. Note that data already included in a published, versioned dataset release cannot be retroactively removed from that version, but will be excluded from all future releases.
7. Questions
If you have any questions about these terms, please open an issue on our GitHub repository or contact the project team directly at oopsie-team@googlegroups.com.
These terms were last updated: 2026-04-03. They may be revised as the project develops; registered contributors will be notified of any material changes.