Social Media Datasets for Programmatic SEO

Last updated:

Looking to create a programmatic SEO website in the social media niche?

You have come to the right place, then. We have collected the 10 best social media datasets for programmatic SEO (most of them are free) that you can download/access for your projects.

Let’s take a look…

10 useful social media datasets for pSEO

Along with a brief description of all the datasets, we have also included the format(s) they are available in.

1. Social Media Influencers

Available format(s): CSV

A dataset containing top 1000 social media influencers from Instagram, YouTube, and TikTok, each with their number of followers and other relevant information.

2. TwineSocial

Available format(s): JSON

TwineSocial allows you to find and access content from multiple social media networks, like Twitter, Instagram, Facebook, Vine, Tumblr, Flickr, and Google+ by using hashtags, account handles, and geo-location. It offers a high-performance, scalable interface with server-side rules and a moderation feature.

3. Emoji Dictionary with R Encodings and Image Files

Available format(s): XLS

A dataset of emojis from Unicode 10.0 with R encodings, Unicode categories, subcategories, and Emojipedia names along with corresponding image files. 2,624 rows in total.

4. Instagram

Available format(s): JSON

The instagram dataset contains basic metadata from Instagram user, hashtag, location feed pages, comments, and people who liked specific posts, followers, and followings from a username.

5. Influencer Search

Available format(s): JSON

A dataset that provides information on influencers through the Social Animal Influencer Search API, including data on twitter profiles, top authors, and best sharers of content for a specific query, with options to sort by followers, number of tweets, location and type of influencer.

6. Usage of social media by students between age 17-22

Available format(s): XLS

A dataset of students between the ages of 17-22, including their age, preferred social media platforms, daily usage time, physical activity time, and perception of exposure to inappropriate content on those platforms. Timestamp is included.

7. Social Networks Global Coverage – Account, Business & Non-business

Available format(s): CSV, JSON, XLS

A dataset of 514 million records of social media accounts from 249 countries with various data points such as followers, profile type, engagement score, location, external links and more. Can be filtered by geography, account type, brand affiliation, hashtags and more.

8. LinkedIn data for 24Million companies

Available format(s): JSON

A dataset of 24 million companies, including company name, country, size, headquarters, website, followers, industry, employees, employees on LinkedIn, about, and founded information.

9. Tagdef

Available format(s): JSON

Tagdef dataset is a large hashtag dictionary containing over 60,000 user-generated definitions for hashtags commonly used on Twitter, Pinterest, and Google+.

10. Twitter Celebrity Tweets And Embeddings

Available format(s): CSV

The Twitter Celebrity Tweets And Embeddings dataset contains tweets and embeddings of top 1000 celebrity Twitter accounts.

That’s it.

All the best for your pSEO projects in the social media niche.


Programmatic SEO OS

A comprehensive operating system for your programmatic SEO projects that helps you master the craft and save 100+ hours.

  • Text + video tutorials
  • 100+ useful datasets
  • 50+ pSEO examples
  • 60+ programmatic SEO tools
  • 30+ case studies
  • Cool people to follow