Reddit as a Listening source

Updated 

By adding Reddit as a Listening Source, you can listen to and learn from Reddit data in real-time. 

Reddit is a massive collection of forums where people can share news and content or comment on other people’s posts. Sprinklr has access to the entire Reddit Firehose, a stream of user activities from Reddit. This means you can listen to all posts and comments from all public, active Subreddits.

Sprinklr repository has more than 125k active communities (not classified as over 18), and additionally, new Subreddits are added to the service immediately as they are created. 

Reddit source capabilities

  • Source: Reddit

  • Source of data: Official API - Firehose (All Reddit or Subreddit posts and engagement stats are available in real-time)

  • Coverage: Direct firehose of both public & privately owned accounts data

  • Latency: Near real-time: 1 to 3 mins

  • Historical data: Quarter 2 2017, Full historic access to owned account data dating back to the account creation date.

  • Backfill Support: Yes 

  • Engagement stats refresh rate: Due to Firehose – API the post is updated in real time, 5 to 8 minutes max, once it is fetched. Reddit scores are updated hourly once, for 7 days and Comment count for every 1 hour.

  • Author/Profile metadata details: Yes (Id, Name, AuthorKarma available)

    • Note: Profile Image is not available, hence age/ gender enrichments might be incorrect.

  • AI-enriched data: Sentiment, Emotions, Language, Word cloud, Topic Cluster, Age (profile), Gender (profile), Smart Themes Clustering, Smart Insights

  • Message type: Reddit Comment, Reddit Submission

  • Media type: Comment, Link, Wall post, Photo, Video, Status

Reddit data entities in Social Listening

Reddit specific data entities

Data Entity

Type

Description

Upvote Percentage

Metric

It shows the percentage of upvotes out of total votes for Reddit posts.

Note: Upvote Percentage is only available to Reddit posts, not for comments.

Reddit Author Karma

Metric

Reddit author karma reflects total votes a reddit user has earned on their posts and comments. When posts or comments get upvoted, that user gains some karma.

Reddit Score

Metric

Reddit Score is a net score that equals Upvotes minus Downvotes for a particular Reddit post or comment.

Web shares on reddit

Metric

Total shares of a web message (from sources like News, Blogs, etc.) on Reddit

Net vote

Metric

The sum of total upvotes minus total downvotes on a post.

Net Vote = Total Upvotes – Total Downvotes

Reddit supported engagement stats

Data Entity

Type

Description

Earned Engagement

Metric

Reddit Score + Comment count (reddit score = net vote)

Distinct Users

Metric

Distinct users count from which mentions are coming in.

Note: It is required to have an Author URL in the message metadata for it to be considered as a distinct user.

Comments Count

Metric

The total count of comments on a mention.

To fetch data from Reddit

  • In order to fetch data from Reddit, you need to create a Topic with Reddit as its source. To learn in detail about how to create a topic, refer to Create a Topic in Topic Creation UI.

  • While creating or editing your topic, click the Apply Filters tab, and select Reddit as one of the Web Sources.

  • You can also select Source: Reddit from the dashboard/widget filter.

Some interesting operator

  • parentMessageparentMessage: (Sachin OR Ronaldo OR Cricket OR Football) – this will fetch mentions whose parent message contains text keywords Sachin or Ronaldo or Cricket or Football.

To backfill data from Reddit

  • Once the topic is created/ updated, you can backfill the topic going far back to Quarter 2 of 2017.

  • To know more about how to backfill data, refer to Listening Backfill for Topics.

Limitations

  • Sender Profile link cannot be accessed due to API limitations

  • Sender Image URL does not work hence age, gender enrichment model does not work for reddit profiles, only works on the username which may not be accurate.

  • No support for differentiation of a "parent" comment v/s a "sub" or "threaded" comment/reply.

  • No location data is provided by API.

Frequently asked questions

No.

If Reddit or Subreddit posts are deleted from native, then they are also deleted from Sprinklr to comply with Reddit policies.

If we get the compliance update from API, then the modified Posts get updated in Sprinklr.

No, in username tokenization is such that it will fetch mentions only from @modi and not others. Also author: Tom / author: tom shows the same result (not case sensitive).

No, reach metric is not supported for Reddit as we do not get follower count due to API limitation.

Yes.

  • For Owned data – you can have data from the establishment of the account

  • For Earned data – backfilling is available from Q2, 2017

Yes, from Sprinklr you can engage with the user Reddit posts by double clicking on the conversation stream which further opens a panel for replying, assigning the Reddit posts to another user or assigning the Reddit posts to a particular macro.

Due to Firehose - API the post is updated in real time, 5 to 8 minutes max, once it is fetched. Reddit scores are updated hourly once, for 7 days and Comment count for every 1 hour.

Reddit pulls posts and comments based on keyword match only. If there is a match only in the comment, Listening will show the comment along with the parent post.

However, if the matched keyword is present in the parent post, then Listening will show all the comments as well in the results.

Yes, we can listen to Subreddits. Subreddits do not require any SV as we have API Firehose. To pull in data from Subreddits, use: url: "subreddit url".

For example, url: "reddit.com/r/auto".

Yes, Sprinklr can pull all posts + comments from an entire url - specific Subreddit.

Reddit has only one type of user account – and Subreddits are individual message boards/threads devoted to one particular topic. Since all Reddit users would post/engage on a particular Subreddit, our documentation captures this with the same terminology.