September 13-16, 2022
Dublin, Ireland + Virtual
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for Open Source Summit Europe 2022 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Irish Standard Time (UTC +1). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

Back To Schedule
Friday, September 16 • 10:50 - 11:30
OpenDataology - An Open Source Dataset License Compliance Project Initiative - Gopi Krishnan Rajbahadur, Huawei Technologies Canada

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
Publicly available datasets are one of the key drivers for commercial AI software. The use of these datasets are governed by dataset licenses. Dataset licenses outline the rights one is entitled to on a given dataset and the obligations that one must fulfil to enjoy such rights. Failure to comply with the license terms might result in serious legal issues. However, unlike standardized Open Source Software (OSS) licenses, existing dataset licenses do not clearly outline the rights and obligations associated with their usage. Also, datasets can be created from multiple data sources each of which may have different licenses which further compounds the issue. Hence, existing approaches on checking OSS license compliance cannot be used. In this talk, we introduce our project OpenDataology, an open source initiative that proposes a new approach to assessing the potential license compliance violations associated with a dataset. We elaborate how our OpenDataology project can be a crowd-sourced medium that enables identifying and documenting the license compliance risks associated with using publicly available datasets for AI software. We demonstrate how OpenDataology identified potential risks of license violations associated with 6 commonly used publicly available datasets in the wild.


Gopi Krishnan Rajbahadur

Senior Researcher, Huawei
Gopi Krishnan Rajbahadur is a Senior Researcher at the Centre for Software Excellence at Huawei, Canada. He holds a PhD in computer science from Queen's University, Canada. He received his BE in Computer Science and Engineering from SKR Engineering College, Anna University, India... Read More →

Friday September 16, 2022 10:50 - 11:30 IST
Wicklow Meeting Room 2 (Level 2)