As it is clear GetOldTweets3
is a python library for accessing old tweets which is not easily feasible with libraries like Tweepy
. But recently, there is an unsolved issue with it due to the new Twitter API: https://github.com/Mottl/GetOldTweets3/issues/98.
The question is what is GetOldTweets3
alternative library for retrieving tweets without time constraints? In my experience, Tweepy
can not retrieve more than 200 tweets.
3
Answers
The 200 tweet limit is a per request maximum. You can retrieve successive "pages" of tweets by using the returned
next
parameter to request the next page of 200. If you are using the Standard Search API, these requests will stop return tweets older than about a week. With Premium Search API full-archive you can get all tweets going back to 2006.It is explained in detail here: https://developer.twitter.com/en/docs/twitter-api/v1/tweets/search/api-reference
So far the only method of scraping tweets that still seems to work is snscrape’s jsonl method.
or this
https://github.com/rsafa/get-latest-tweets/
I would recommend using snscrape. The IDs collected in this way can then be pass to api.statuses_lookup. By using api.statuses_lookup you can download 300*100 tweets per 15 minutes via Twitter api.