25 lines
707 B
Markdown
25 lines
707 B
Markdown
# CSDN 爬虫
|
|
|
|
> 主要功能:爬取 csdn 博客指定用户的所有博文并转换为 markdown 格式保存到本地。
|
|
|
|
## 下载脚本
|
|
```
|
|
git clone https://github.com/ds19991999/csdn-spider.git
|
|
cd csdn-spider
|
|
python3 -m pip install -r requirements.txt
|
|
```
|
|
|
|
## 爬取用户全部博文
|
|
```python
|
|
import csdn
|
|
csdn.spider(["ds19991999", "u013088062"], 5)
|
|
# 参数 usernames: list, thread_num: int = 10, folder_name: str = "articles"
|
|
```
|
|
|
|
## LICENSE
|
|
|
|
<a rel="license" href="http://creativecommons.org/licenses/by-nc-sa/4.0/"><img alt="Creative Commons License" style="border-width:0" src="https://i.creativecommons.org/l/by-nc-sa/4.0/88x31.png" /></a>
|
|
|
|
|
|
|
|
`PS`:随意写的爬虫脚本,佛系更新。 |