twint 安装及使用

Easter79
• 阅读 1621

分享这个post是自己方便查,还有中文网界对这个东西介绍太少。 更多的就看github项目twint吧。

Installation:

git+pip3:

git clone https://github.com/twintproject/twint.git
pip3 install -r requirements.txt
pip3 install twint

or pip3+pipenv:

pip3 install --user --upgrade -e git+https://github.com/twintproject/twint.git@origin/master#egg=twint
pipenv install -e git+https://github.com/twintproject/twint.git#egg=twint

You may meet module cannot found error when you try to run twint after installation. On ubuntu, add ~/.local/bin into your PATH by:

export PATH=$PATH:~/.local/bin

You may edit ~/.bashrc file to permanately add the '~/.local/bin' into your PATH.

Usage:

Running the twint cmd with arguments can give you results. A few simple examples to help you understand the basics:

  • twint -u username - Scrape all the Tweets from user's timeline.
  • twint -u username -s pineapple - Scrape all Tweets from the user's timeline containing pineapple.
  • twint -s pineapple - Collect every Tweet containing pineapple from everyone's Tweets.
  • twint -u username --year 2014 - Collect Tweets that were tweeted before 2014.
  • twint -u username --since "2015-12-20 20:30:15" - Collect Tweets that were tweeted since 2015-12-20 20:30:15.
  • twint -u username --since 2015-12-20 - Collect Tweets that were tweeted since 2015-12-20 00:00:00.
  • twint -u username -o file.txt - Scrape Tweets and save to file.txt.
  • twint -u username -o file.csv --csv - Scrape Tweets and save as a csv file.
  • twint -u username --email --phone - Show Tweets that might have phone numbers or email addresses.
  • twint -s "Donald Trump" --verified - Display Tweets by verified users that Tweeted about Donald Trump.
  • twint -g="48.880048,2.385939,1km" -o file.csv --csv - Scrape Tweets from a radius of 1km around a place in Paris and export them to a csv file.
  • twint -u username -es localhost:9200 - Output Tweets to Elastic search
  • twint -u username -o file.json --json - Scrape Tweets and save as a json file.
  • twint -u username --database tweets.db - Save Tweets to a SQLite database.
  • twint -u username --followers - Scrape a Twitter user's followers.
  • twint -u username --following - Scrape who a Twitter user follows.
  • twint -u username --favorites - Collect all the Tweets a user has favorited (gathers ~3200 tweet).
  • twint -u username --following --user-full - Collect full user information a person follows
  • twint -u username --profile-full - Use a slow, but effective method to gather Tweets from a user's profile (Gathers ~3200 Tweets, Including Retweets).
  • twint -u username --retweets - Use a quick method to gather the last 900 Tweets (that includes retweets) from a user's profile.
  • twint -u username --resume resume_file.txt - Resume a search starting from the last saved scroll-id.

More detail about the commands and options are located in the wiki

点赞
收藏
评论区
推荐文章
blmius blmius
2年前
MySQL:[Err] 1292 - Incorrect datetime value: ‘0000-00-00 00:00:00‘ for column ‘CREATE_TIME‘ at row 1
文章目录问题用navicat导入数据时,报错:原因这是因为当前的MySQL不支持datetime为0的情况。解决修改sql\mode:sql\mode:SQLMode定义了MySQL应支持的SQL语法、数据校验等,这样可以更容易地在不同的环境中使用MySQL。全局s
Easter79 Easter79
2年前
tcc分布式事务源码解析系列(二)之环境搭建
环境搭建教程在上一篇中,我们了解了项目的整体结构,以及每个模块大概的作用,现在我们来开始搭建整个环境。首先用户使用的JDK必须是1.8本地安装了git,maven,执行以下命令gitclonehttps://github.com/yu199195/happylifeplattcc.git
Stella981 Stella981
2年前
Apache Arrow入门
1.源码编译1.1环境准备安装cmake等工具sudoaptgetinstall\buildessential\cmake1.2下载源码gitclonehttps://github.com/apache/arrow.g
Wesley13 Wesley13
2年前
PHPCon2019
!(https://oscimg.oschina.net/oscnet/58c26436754f462a690d46969a35bf9938b.jpg)官方PHPCon2019PPT已上线 分享如下https://github.com/ThinkDevelopers/PHPConChina请勿私
Wesley13 Wesley13
2年前
Ubuntu虚拟机EOS安装教程
EOS安装一、源码下载注意事项:二、编译源码注意事项:三、安装1.install安装2.测试(可跳过)四、运行总结一、源码下载gitclonehttps://github.com
Wesley13 Wesley13
2年前
Java日期时间API系列30
  实际使用中,经常需要使用不同精确度的Date,比如保留到天2020042300:00:00,保留到小时,保留到分钟,保留到秒等,常见的方法是通过格式化到指定精确度(比如:yyyyMMdd),然后再解析为Date。Java8中可以用更多的方法来实现这个需求,下面使用三种方法:使用Format方法、 使用Of方法和使用With方法,性能对比,使用
Wesley13 Wesley13
2年前
ubuntu16.0.4安装freeswitch
1\.安装freeswitch库gitclonehttps://github.com/signalwire/freeswitch.git2\.安装环境依赖aptgetinstalllibeditdevlibldnsdevlibpcre3devlibspeexdspdevlibspeexdevl
Stella981 Stella981
2年前
Linux日志安全分析技巧
0x00前言我正在整理一个项目,收集和汇总了一些应急响应案例(不断更新中)。GitHub地址:https://github.com/Bypass007/EmergencyResponseNotes本文主要介绍Linux日志分析的技巧,更多详细信息请访问Github地址,欢迎Star。0x01日志简介Lin
Stella981 Stella981
2年前
Git7:使用Gitlab管理远程仓库
\toc\1、安装gitlab的源码安装极其复杂,如果要尝试源码安装可以参考如下文档:https://github.com/gitlabhq/gitlabhq/blob/master/doc/install/installation.md(https://www.oschina.net/action/GoToLink?urlht
Python进阶者 Python进阶者
3个月前
Excel中这日期老是出来00:00:00,怎么用Pandas把这个去除
大家好,我是皮皮。一、前言前几天在Python白银交流群【上海新年人】问了一个Pandas数据筛选的问题。问题如下:这日期老是出来00:00:00,怎么把这个去除。二、实现过程后来【论草莓如何成为冻干莓】给了一个思路和代码如下:pd.toexcel之前把这