Share Datasets of News Broadcasts

#14
by constantinSch - opened
Journalists on Hugging Face org

Hello everyone,

I have a question about academic data sets containing news media content. Let's assume that a public broadcaster wants to make some of its video news broadcasts accessible to researchers on HF. This would mainly involve providing permanent links to the videos on their distribution platform, along with some internal metadata (e.g. abstracts, subtitles, etc.). For legal reasons, these could only be shared for non-commercial use.

Do you know of any published datasets like this or papers that you could point me towards? I am looking especially for legal wording and the kind of licenses used. I am particularly interested in datasets shared by public service broadcasters, and even more so by members of the European Broadcasting Union. I came across some related examples on HF, but they're not quite what I am looking for:

I am grateful for all suggestions. Also, if you work for a public service broadcaster and have experience sharing archive data according to the FAIR principles, I would love to hear from you.

Kind regards,
Constantin

Sign up or log in to comment