Wanted: open data suitable for a data science project!

Every year, we ask our students 2nd bach to do a project. We give them a (big) dataset (preferably in JSON or a bunch of files combined), ask some research question, and ask them to perform data cleaning and exploration related to that question using #Rstats.

I've used most obvious choices, so I'm turning to Fedi to find new, interesting datasets. If you have no idea, sharing helps too!

#dataScience #openData #Fedihelp #education

Tillman Reuter

@tillmanreuter@ecoevo.social

use #GBIF species occurrence point-data (

). Also spatial, and combinable: geodaten.bayern.de

February 23, 2026 at 10:16:35 AM

Thanks, I hadn't seen

before. Great source for species occurence, totally agree. Could be nice in combination with climate data or something else to map species occurence against. If you have suggestions that are doable for an intro level course Data science, would love to hear them!

sure! Can be combined nicely with bioclim (load through R) or era5(download as reanalysis from the web)

Bioclim is a great idea, thx!

We also have a wealth of training resources and structured learning modules that may be helpful for your task. 馃檪 GBIF provides access to more than 3.6 billion open access occurrence records, so if you're after big data, you've come to the right place! 馃尡 gbif.org/training

Elk Logo

Elk is in Preview!

Thanks for your interest in trying out Elk, our work-in-progress Mastodon web client!

Expect some bugs and missing features here and there. we are working hard on the development and improving it over time.

Elk is Open Source. If you'd like to help with testing, giving feedback, or contributing, reach out to us on GitHub and get involved.

To boost development, you can sponsor the Team through GitHub Sponsors. We hope you enjoy Elk!

Daniel Roe涓夊挷鏅哄瓙 Kevin DengAnthony FuPatak

The Elk Team