Skip to main content
Show Me The Data
HomeIntro
About

Newsletter

Get insights on platform data and research

Subscribe

YouTube Channel

Video tutorials and insights

Subscribe

Support on Patreon

Help create more content

Become a Patron

Buy Me a Coffee

One-time support

Buy Coffee

Created by Matt Motyl

© 2025 Matt Motyl. All rights reserved.

On This Page

Submit Feedback

Introduction to Platform Data

Introduction

Very large online platforms and search engines (VLOPSEs) collect both an incredible amount and a wide variety of data that even newly hired employees can struggle with as they learn the internal data systems. The structure of the data and the data warehouse that the companies use to store all their data can vary from company to company. Some might be chaotic, with data stored in tables that employees organically produce over time, and some might be more standardized and structured with employees required to follow a strict framework. In either situation, it can still be challenging to understand the potentially hundreds or thousands of tables that VLOPSEs maintain in their data warehouses.

Given the role of online platforms in our lives, it is not surprising that civil society and research institutions recognize the importance of these data, and in particular how it can help us better understand the risks that online platforms can pose to people, societies, and democracies. This data can also tell us how effectively the company is managing and minimizing those risks. Researchers and civil societies need to have access to datasets, both historical and real time, for studying the scale, cause, and nature of risks from the platforms. They need to be able to monitor the information environment in real time, especially around critical societal events like elections.

Article 40 of the Digital Services Act (DSA) allows for significant researcher access to VLOPSE data, creating the opportunity to answer research questions surrounding the systemic risks outlined in the DSA. This paper aims to equip vetted academic and civil society researchers with the understanding and tools necessary to best utilize their access to the data for the public good.

The Digital Services Act (DSA)

The DSA is landmark European legislation that imposes new obligations on digital platforms, particularly Very Large Online Platforms (VLOPs) and Very Large Online Search Engines (VLOSEs) - defined as services with more than 45 million monthly active users in the EU.

Key provisions include:

  • Transparency reporting requirements
  • Data access for researchers
  • Content moderation
  • Recommendation system disclosures
  • Risk assessments and mitigation measures

About VLOPs and VLOSEs

Very Large Online Platforms (VLOPs)

Online platforms with more than 45 million monthly active users in the EU. These include social media platforms, marketplaces, and app stores that must comply with enhanced DSA obligations.

Very Large Online Search Engines (VLOSEs)

Search engines with more than 45 million monthly active users in the EU. Currently includes Google Search and Bing, subject to similar transparency requirements as VLOPs.

All Designated Platforms

The table below shows all 25 platforms designated as VLOPs or VLOSEs. Click column headers to sort.

Complete list of VLOPs and VLOSEs (as of October 2025)

PlatformTypeEU Users (Est.)API AvailableResearch Access
FacebookVLOP255M+Graph API Yes
InstagramVLOP250M+Graph API Limited
TikTokVLOP150M+Research API Application Required
YouTubeVLOP400M+Data API v3 Yes
X (Twitter)VLOP100M+X API v2 Paid Tiers
LinkedInVLOP180M+LinkedIn API Limited
SnapchatVLOP100M+Marketing API No
PinterestVLOP130M+Pinterest API Limited
WikipediaVLOP350M+MediaWiki API Yes (Open)
Amazon StoreVLOP200M+Product Advertising API Limited
AliExpressVLOP150M+Affiliate API Limited
Google PlayVLOP300M+Developer API No
Apple AppStoreVLOP200M+App Store Connect API No
Booking.comVLOP80M+Affiliate API No
Google MapsVLOP400M+Maps Platform API Limited
Google ShoppingVLOP300M+Content APILimited
ZalandoVLOP50M+Partner API No
SheinVLOP100M+Affiliate APINo
TemuVLOP92M+LimitedNo
PornhubVLOP130M+LimitedNo
StripchatVLOP75M+NoNo
XVideosVLOP165M+NoNo
XNXXVLOP150M+NoNo
Google SearchVLOSE400M+Custom Search API Limited
BingVLOSE100M+Bing Search API Limited

Click column headers to sort. 25 total rows.

Platform Categories

Social Media

  • Facebook
  • Instagram
  • TikTok
  • YouTube
  • X (Twitter)
  • LinkedIn
  • Snapchat
  • Pinterest

Marketplaces & Services

  • Amazon Store
  • AliExpress
  • Booking.com
  • Zalando
  • Shein
  • Temu

Search & Discovery

  • Google Search (VLOSE)
  • Bing (VLOSE)
  • Google Maps
  • Google Shopping
  • Google Play
  • Apple AppStore
  • Wikipedia

Tip: Throughout this guide, terms with dotted underlines have tooltip definitions. Hover over them to see a quick definition, or click "Learn more" to come here for the full explanation.

Platform-Specific Guides Coming Soon (pending support)

Detailed guides for each platform will include:

  • Data access methods and authentication requirements
  • Available datasets and data structures
  • API documentation and code examples
  • Research access programs and application processes
  • Best practices for working with platform-specific data
  • Known limitations and considerations
Back to HomeNext: Data Types