I’m trying to pull data from Twitter over a month or so for a project. There are <10000 tweets over this time period with this hashtag, but I’m only seeming to get all the tweets from the current day. I got 68 yesterday, and 80 today; both were timestamped with the current day.
api = tweepy.API(auth)
igsjc_tweets = api.search(q="#igsjc", since='2014-12-31', count=100000)
ipdb> len(igsjc_tweets)
80
I know for certain there should be more than 80 tweets. I’ve heard that Twitter rate-limits to 1500 tweets at a time, but does it also rate-limit to a certain day? Note that I’ve also tried the Cursor
approach with
igsjc_tweets = tweepy.Cursor(api.search, q="#igsjc", since='2015-12-31', count=10000)
This also only gets me 80 tweets. Any tips or suggestions on how to get the full data would be appreciated.
2
Answers
Here’s the official tweepy tutorial on
Cursor
. Note: you need to iterate through theCursor
, shown below. Also, there is a max count that you can pass.items()
, so it’s probably a good idea to pull month-by-month or something similar and probably a good idea tosleep
in between calls. HTH!First, tweepy cannot bring too old data using its search API
I don’t know the exact limitation but maybe month or two back only.
anyway,
you can use this piece of code to get tweets.
i run it in order to get tweets from last few days and it works for me.
notice that you can refine it and add geocode information – i left an example commented out for you
Good luck