Reddit as a Listening source
Updated
By adding Reddit as a Listening Source, you can listen to and learn from Reddit data in real-time.
Reddit is a massive collection of forums where people can share news and content or comment on other people’s posts. Sprinklr has access to the entire Reddit Firehose, a stream of user activities from Reddit. This means you can listen to all posts and comments from all public, active Subreddits.
Sprinklr repository has more than 125k active communities (not classified as over 18), and additionally, new Subreddits are added to the service immediately as they are created.
Reddit source capabilities
Source: Reddit
Source of data: Official API - Firehose (All Reddit or Subreddit posts and engagement stats are available in real-time)
Coverage: Direct firehose of both public & privately owned accounts data
Latency: Near real-time: 1 to 3 mins
Historical data: Quarter 2 2017, Full historic access to owned account data dating back to the account creation date.
Backfill Support: Yes
Engagement stats refresh rate: Due to Firehose – API the post is updated in real time, 5 to 8 minutes max, once it is fetched. Reddit scores are updated hourly once, for 7 days and Comment count for every 1 hour.
Author/Profile metadata details: Yes (Id, Name, AuthorKarma available)
Note: Profile Image is not available, hence age/ gender enrichments might be incorrect.
AI-enriched data: Sentiment, Emotions, Language, Word cloud, Topic Cluster, Age (profile), Gender (profile), Smart Themes Clustering, Smart Insights
Message type: Reddit Comment, Reddit Submission
Media type: Comment, Link, Wall post, Photo, Video, Status
Reddit data entities in Social Listening
Reddit specific data entities
Data Entity | Type | Description |
Upvote Percentage | Metric | It shows the percentage of upvotes out of total votes for Reddit posts. Note: Upvote Percentage is only available to Reddit posts, not for comments. |
Reddit Author Karma | Metric | Reddit author karma reflects total votes a reddit user has earned on their posts and comments. When posts or comments get upvoted, that user gains some karma. |
Reddit Score | Metric | Reddit Score is a net score that equals Upvotes minus Downvotes for a particular Reddit post or comment. |
Web shares on reddit | Metric | Total shares of a web message (from sources like News, Blogs, etc.) on Reddit |
Net vote | Metric | The sum of total upvotes minus total downvotes on a post. Net Vote = Total Upvotes – Total Downvotes |
Reddit supported engagement stats
Data Entity | Type | Description |
Earned Engagement | Metric | Reddit Score + Comment count (reddit score = net vote) |
Distinct Users | Metric | Distinct users count from which mentions are coming in. Note: It is required to have an Author URL in the message metadata for it to be considered as a distinct user. |
Comments Count | Metric | The total count of comments on a mention. |
To fetch data from Reddit
In order to fetch data from Reddit, you need to create a Topic with Reddit as its source. To learn in detail about how to create a topic, refer to Create a Topic in Topic Creation UI.
While creating or editing your topic, click the Apply Filters tab, and select Reddit as one of the Web Sources.
You can also select Source: Reddit from the dashboard/widget filter.
Some interesting operator
parentMessage – parentMessage: (Sachin OR Ronaldo OR Cricket OR Football) – this will fetch mentions whose parent message contains text keywords Sachin or Ronaldo or Cricket or Football.
To backfill data from Reddit
Once the topic is created/ updated, you can backfill the topic going far back to Quarter 2 of 2017.
To know more about how to backfill data, refer to Listening Backfill for Topics.
Limitations
Sender Profile link cannot be accessed due to API limitations
Sender Image URL does not work hence age, gender enrichment model does not work for reddit profiles, only works on the username which may not be accurate.
No support for differentiation of a "parent" comment v/s a "sub" or "threaded" comment/reply.
No location data is provided by API.