Joris Meys

@JorisMeys@mstdn.social

Wanted: open data suitable for a data science project!

Every year, we ask our students 2nd bach to do a project. We give them a (big) dataset (preferably in JSON or a bunch of files combined), ask some research question, and ask them to perform data cleaning and exploration related to that question using #Rstats.

I've used most obvious choices, so I'm turning to Fedi to find new, interesting datasets. If you have no idea, sharing helps too!

#dataScience #openData #Fedihelp #education

February 23, 2026 at 7:12:32 AM

You could use some animal tracking data, e.g. from movebank. We also have drone images of seals if you are interested.

there's also the LiveMouseTracker datasets, very detailed movement and interaction data of up to 4 lab mice in SQLite format.
See here for the project's website micecraft.org/lmt/,
and here for an example dataset: micecraft.org/lmt/download/201

(This project is cool for many other reasons, not least the ikea-style build instructions or the fact that they repurpose Kinect cameras for scientific work as high-res 3D video cameras).

micecraft.org

Live Mouser Tracker - MiceCraft

Thanks for the suggestion! I've seen movebank before, but forgot about it. Drone images of seals are absolutely cool, but that's going to be a bit much for an intro course data science I think :-)

The chess server LiChess

releases all chess games played on the server under cc0. 7,507,487,928 games so far, released in monthly batches.

database.lichess.org/

database.lichess.org

lichess.org open database

Thanks for sharing, that's a massive collection! Honestly a bit too massive for the task at hand actually, even though I know for a fact some of my students would love this. (they're playing chess during class, the little raskalls)

Elk Logo

Elk is in Preview!

Thanks for your interest in trying out Elk, our work-in-progress Mastodon web client!

Expect some bugs and missing features here and there. we are working hard on the development and improving it over time.

Elk is Open Source. If you'd like to help with testing, giving feedback, or contributing, reach out to us on GitHub and get involved.

To boost development, you can sponsor the Team through GitHub Sponsors. We hope you enjoy Elk!

Patak三咲智子 Kevin DengDaniel RoeAnthony Fu

The Elk Team