Skip to main content
Show Me The Data
HomeIntro
About

Newsletter

Get insights on platform data and research

Subscribe

YouTube Channel

Video tutorials and insights

Subscribe

Support on Patreon

Help create more content

Become a Patron

Buy Me a Coffee

One-time support

Buy Coffee

Created by Matt Motyl

© 2025 Matt Motyl. All rights reserved.

Have you ever wanted to understand and work with tech data?

Look no further!

Welcome to my comprehensive and interactive guide to understanding how social media platforms and search engines collect, process, use, and store data. Master the skills needed to work with platform data through interactive tools and real-world examples.

Get StartedTry SQL ToolTry API Demo

About This Guide

Matt Motyl

My name is Matt Motyl, and I'm a technologist and behavioral data scientist who worked in a data intensive role for one of the very large online platforms. When I was a professor teaching statistics, I knew that many students came into those courses dreading them. As someone who loves statistics and data (probably too much), I always tried to strip out the needlessly complicated jargon and symbols, and help students to see how data and stats are actually useful and not anywhere near as scary as they first thought. When I moved into industry, I found the same thing to be true: there is a lot of unnecessary jargon and confusing-looking things to deal with just to access the data. So, I wanted to create a resource that would help demystify platform data for those outside these companies.

One of the projects I led as a Resident Fellow at the Integrity Institute for the European Digital Media Observatory (EDMO) was writing a brief handbook on how online platforms collect and structure data, and how people outside of these companies can access and use data from these companies. Tha handbook was well-received, but it was also written like a handbook -- not as an interactive guide with exercises to help readers practice what the handbook was trying to teach them to do.

Therefore, I built this interactive web version to enhance understanding by providing more explanations, practical examples, and hands-on tools to help you master obtaining, analyzing, and using platform data. My hope is to make this an evergreen resource that promotes data literacy and is as helpful to as many people as possible. I am not being paid for my work on this site or for the infrastructure costs of maintaining it, but I will try to add new content, features, and keep it up to date as best I can. If you have questions on anything on this website, want to flag anything that is out-of-date (e.g., platform API changes), or have suggestions for new content or features, please let me know on this Google form. If you'd like to discuss more personalized assistance, consulting, or training, please email me at matt.motyl@gmail.com.

While the original handbook was part of a contract between the Integrity Insitute and EDMO, and was reviewed by Jeff Allen, a co-founder and the Chief Research Officer, Spencer Gurley, the Research Lead, and Sofia Bonilla, the Strategy & Partnerships Lead at the Integrity Institute, and by representatives from EDMO, this website has not been reviewed or approved of in any formal way by those organizations. The website generally follows from the handbook, but any divergences should be interpreted as my own perspective and not necessarily that of the Integrity Institute or EDMO.

Goals of This Guide

Specifically, this guide aims to:

1.Describe what types of data VLOPSEs collect
2.Describe how these data are stored
3.Describe how these data are (or can be) used
4.Provide a basic introduction to SQL with practice activities
5.'Provide a basic introduction to APIs with an interactive demonstration
6.Share a list of platform data that are already publicly available
7.Provide examples of how platform data can be used to answer questions
8.Review data access APIs from VLOPSEs

Who This Guide Is For

🎓

Researchers & Academics

Learn how to access and analyze platform data for studies, publications, and understanding online information ecosystems.

🏛️

Regulators & Policy Makers

Monitor platform compliance, assess risks, and develop evidence-based policies using platform data.

🤝

Civil Society Organizations

Investigate platform impacts on communities, hold platforms accountable, and advocate for change.

💡

Think Tanks & Policy Experts

Develop evidence-based policy recommendations using platform data and research insights.

📰

Journalists & Reporters

Use platform data to investigate stories, verify claims, and report on digital platform practices.

📊

Data Analysts

Work with complex platform datasets using SQL, APIs, and data visualization techniques.

How to Use This Guide

This guide is structured to take you from basic concepts to practical application:

  1. Start with the introduction to understand the landscape
  2. Learn about different data types and how they're structured
  3. Understand how data is mapped to predict outcomes
  4. Practice SQL queries with our interactive playground
  5. Learn to connect to and use platform APIs
  6. Explore common pitfalls and other data resources

What You'll Learn

Platform Data Fundamentals

  • ✓ How platforms collect and structure data
  • ✓ Understanding data schemas and relationships
  • ✓ Privacy and ethical considerations
  • ✓ DSA compliance and transparency requirements

Technical Skills

  • ✓ Writing SQL queries for data analysis
  • ✓ Connecting to platform APIs
  • ✓ Data extraction and transformation
  • ✓ Best practices for data handling

Platform-Specific Guides

  • ✓ Data availability for each VLOP/VLOSE
  • ✓ Platform-specific API documentation
  • ✓ Common data structures and formats
  • ✓ Real-world use cases and examples

Interactive Learning

  • ✓ Practice SQL with synthetic data
  • ✓ Live API connection demos
  • ✓ Sortable data tables and examples
  • ✓ Hands-on exercises and challenges

Ready to Get Started?

Begin your journey to understanding and working with platform data

Start Learning Now