Questions tagged [reddit]

Use for questions involving the API for Reddit, a social news ranking website.

Reddit, pronounced /ˈrɛdɪt/ or "red-it", is a social news ranking website.

Users have the option to submit links to content on the Internet or submit "self" posts that contain original, user-submitted text.

Other users may then vote the posted links "up" or "down" with the most successful links gaining prominence by reaching the front page.

In addition, users can comment on the posted links and reply to other commentators consequently forming an online community. Reddit users (also referred to as redditors) may create their own topical sections, known as subreddits, for which to submit their links and to comment, while appealing to a specific niche.

Reddit is open source, and the primary codebase is hosted on Github. An extensive API is also provided.

1261 questions
10
votes
2 answers

Big query distinct on and group by

Following on from Select first row in each GROUP BY group? I am trying to do a very similar thing in Google big query. Dataset: fh-bigquery:reddit_comments.2018_01 Aim: For each link_id (Reddit submission) select the first comment in terms of…
F.D
  • 767
  • 2
  • 10
  • 23
10
votes
3 answers

How can I tell Reddit to ignore my Logo for the Thumbnail?

When a user shares content from my website, often the image appearing in the thumbnail is the logo of the website instead of the image itself. How can I tell reddit to ignore the logo ? I searched a lot and found answers on how to specify which…
Coder1000
  • 4,071
  • 9
  • 35
  • 84
10
votes
3 answers

urllib2 HTTP error 429

So I have a list of sub-reddits and I'm using urllib to open them. As I go through them eventually urllib fails with: urllib2.HTTPError: HTTP Error 429: Unknown Doing some research I found that reddit limits the ammount of requests to their servers…
Florin Stingaciu
  • 8,085
  • 2
  • 24
  • 45
9
votes
1 answer

Retrieving comments from Reddit's API

So I've written some code that searches reddits api based on a query and I want it to display comments as well. I have the following code nested inside my $.getJSON statement that pulls each title/post based on your search query, and now I want to…
mickdeez
  • 507
  • 3
  • 7
  • 16
9
votes
2 answers

Unable to refresh Reddit OAuth 2.0 access token

I cannot refresh the Reddit access token. When I send following request to https://ssl.reddit.com/api/v1/access_token Content-Type: application/x-www-form-urlencoded Authorization:…
Peter Hudec
  • 2,462
  • 3
  • 22
  • 29
9
votes
1 answer

What is the Cassandra database schema used in Reddit?

Reddit is currently migrating its database from PosgreSQL to Apache Cassandra. Does anybody know what database schema does Reddit use in Cassandra?
Calin-Andrei Burloiu
  • 1,481
  • 2
  • 13
  • 25
8
votes
2 answers

Error 429 when invoking Reddit api from Google App Engine

I have been running a cron job on Google App Engine for over a month now without any issues. The job does a variety of things, one being that it uses urllib2 to make a call to retrieve a json response from Reddit as well as a few other sites. About…
dMcNavish
  • 627
  • 6
  • 12
8
votes
2 answers

Getting more than 100 search results with PRAW?

I'm using the following code to obtain reddit search results with PRAW 4.4.0: params = {'sort':'new', 'time_filter':'year'} return reddit.subreddit(subreddit).search('', **params) I'd like to scrape an indefinite amount of posts from the subreddit,…
Dreadnaught
  • 125
  • 2
  • 6
8
votes
1 answer

ValueError: jpeg is required unless explicitly disabled using --disable-jpeg, aborting

I try to install reddit on a 12.04 ubuntu server but I get an error Please see the lines from terminal below. Please help me to fix it. I get this error after the ff line "sudo python setup.py develop" I already install the all necessary…
7
votes
3 answers

How do I integrate a HTML code in a Python Script?

I have a Python code that creates a report for a data frame from Reddit, and converts it to simple HTML and then email's it out. Below is the code: #Clean all the Dataframes test_clean = clean(test_test_df) brand_clean =…
Piyush Patil
  • 14,512
  • 6
  • 35
  • 54
7
votes
1 answer

How do I treat empty Strings as null objects with GSON?

I'm retrieving comments from the Reddit API. The model is threaded such that each Comment can internally have a List of Comments, named replies. Here's an example of how a JSON response would look: [ { "kind":"Listing", "data":{ …
Steve
  • 538
  • 4
  • 17
7
votes
1 answer

BigQuery - select top N posts from a large table for each subreddit

I am doing data mining on Reddit data on Google BigQuery and I wanna top 1000 posts ranked by the score for each subreddit for the whole 201704 data. I have tried different techniques but due to the limitation of BigQuery, the result is too large to…
Julian.Wu
  • 73
  • 1
  • 1
  • 4
7
votes
1 answer

ELM parse nested json

I have a json array with multiple comments which can be nested. exemple: [ { "author": "john", "comment" : ".....", "reply": "", }, { "author": "Paul", "comment" : ".....", "reply": [ { "author":…
BoumTAC
  • 3,531
  • 6
  • 32
  • 44
7
votes
3 answers

Is it possible to get a RSS feed of a reddit with links to posts with X upvotes?

I created a subreddit that I am connecting to a twitter account via twitterfeed. Currently I have it pointed to this RSS feed. I would like to filter this RSS feed with posts that have X upvotes such that only the good posts reach the Twitter…
Bryan Denny
  • 27,363
  • 32
  • 109
  • 125
7
votes
2 answers

PRAW: Replying to a post

As the title says, I'm unable to find the function in PRAW to reply to a post on Reddit. I've written a rather simple bot so far, which looks for new posts on a certain subreddit. My problem is that once finding the submission, I can't find how to…
agsl
  • 151
  • 2
  • 8
1
2
3
84 85