GitHub

您所在的位置:网站首页 推特图片爬虫 GitHub

GitHub

2024-07-14 07:41| 来源: 网络整理| 查看: 265

Twitter Scraper

GitHub GitHub contributors code size maintain status

🇰🇷 Read Korean Version

Twitter's API is annoying to work with, and has lots of limitations — luckily their frontend (JavaScript) has it's own API, which I reverse–engineered. No API rate limits. No restrictions. Extremely fast.

You can use this library to get the text of any user's Tweets trivially.

Prerequisites

Before you begin, ensure you have met the following requirements:

Internet Connection Python 3.6+ Installing twitter-scraper

If you want to use latest version, install from source. To install twitter-scraper from source, follow these steps:

Linux and macOS:

git clone https://github.com/bisguzar/twitter-scraper.git cd twitter-scraper sudo python3 setup.py install

Also, you can install with PyPI.

pip3 install twitter_scraper Using twitter_scraper

Just import twitter_scraper and call functions!

→ function get_tweets(query: str [, pages: int]) -> dictionary

You can get tweets of profile or parse tweets from hashtag, get_tweets takes username or hashtag on first parameter as string and how much pages you want to scan on second parameter as integer.

Keep in mind: First parameter need to start with #, number sign, if you want to get tweets from hashtag. pages parameter is optional. Python 3.7.3 (default, Mar 26 2019, 21:43:19) [GCC 8.2.1 20181127] on linux Type "help", "copyright", "credits" or "license" for more information. >>> from twitter_scraper import get_tweets >>> >>> for tweet in get_tweets('twitter', pages=1): ... print(tweet['text']) ... spooky vibe check …

It returns a dictionary for each tweet. Keys of the dictionary;

Key Type Description tweetId string Tweet's identifier, visit twitter.com/USERNAME/ID to view tweet. userId string Tweet's userId username string Tweet's username tweetUrl string Tweet's URL isRetweet boolean True if it is a retweet, False otherwise isPinned boolean True if it is a pinned tweet, False otherwise time datetime Published date of tweet text string Content of tweet replies integer Replies count of tweet retweets integer Retweet count of tweet likes integer Like count of tweet entries dictionary Has hashtags, videos, photos, urls keys. Each one's value is list → function get_trends() -> list

You can get the Trends of your area simply by calling get_trends(). It will return a list of strings.

Python 3.7.3 (default, Mar 26 2019, 21:43:19) [GCC 8.2.1 20181127] on linux Type "help", "copyright", "credits" or "license" for more information. >>> from twitter_scraper import get_trends >>> get_trends() ['#WHUTOT', '#ARSSOU', 'West Ham', '#AtalantaJuve', '#バビロニア', '#おっさんずラブinthasky', 'Southampton', 'Valverde', '#MMKGabAndMax', '#23NParoNacional'] → class Profile(username: str) -> class instance

You can get personal information of a profile, like birthday and biography if exists and public. This class takes username parameter. And returns itself. Access informations with class variables.

Python 3.7.3 (default, Mar 26 2019, 21:43:19) [GCC 8.2.1 20181127] on linux Type "help", "copyright", "credits" or "license" for more information. >>> from twitter_scraper import Profile >>> profile = Profile('bugraisguzar') >>> profile.location 'Istanbul' >>> profile.name 'Buğra İşgüzar' >>> profile.username 'bugraisguzar' → .to_dict() -> dict

to_dict is a method of Profile class. Returns profile datas as Python dictionary.

Python 3.7.3 (default, Mar 26 2019, 21:43:19) [GCC 8.2.1 20181127] on linux Type "help", "copyright", "credits" or "license" for more information. >>> from twitter_scraper import Profile >>> profile = Profile("bugraisguzar") >>> profile.to_dict() {'name': 'Buğra İşgüzar', 'username': 'bugraisguzar', 'birthday': None, 'biography': 'geliştirici@peptr', 'website': 'bisguzar.com', 'profile_photo': 'https://pbs.twimg.com/profile_images/1199305322474745861/nByxOcDZ_400x400.jpg', 'banner_photo': 'https://pbs.twimg.com/profile_banners/1019138658/1555346657/1500x500', 'likes_count': 2512, 'tweets_count': 756, 'followers_count': 483, 'following_count': 255, 'is_verified': False, 'is_private': False, user_id: "1019138658"} Contributing to twitter-scraper

To contribute to twitter-scraper, follow these steps:

Fork this repository. Create a branch with clear name: git checkout -b . Make your changes and commit them: git commit -m '' Push to the original branch: git push origin / Create the pull request.

Alternatively see the GitHub documentation on creating a pull request.

Contributors

Thanks to the following people who have contributed to this project:

@kennethreitz (author) @bisguzar (maintainer) @lionking6792 @ozanbayram @xeliot Contact

If you want to contact me you can reach me at @bugraisguzar.

License

This project uses the following license: MIT.



【本文地址】


今日新闻


推荐新闻


CopyRight 2018-2019 办公设备维修网 版权所有 豫ICP备15022753号-3