Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
P
Python-100-Days
Project
Project
Details
Activity
Releases
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
huangkq
Python-100-Days
Commits
e4204ed9
Commit
e4204ed9
authored
May 29, 2018
by
jackfrued
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
修复了文档中的bug
parent
48344a71
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
3 additions
and
3 deletions
+3
-3
01.网络爬虫和相关工具.md
Day66-75/01.网络爬虫和相关工具.md
+2
-2
02.数据采集和解析.md
Day66-75/02.数据采集和解析.md
+1
-1
No files found.
Day66-75/01.网络爬虫和相关工具.md
View file @
e4204ed9
...
...
@@ -93,11 +93,11 @@ Disallow: /
HTTP请求(请求行+请求头+空行+
[
消息体
]
):


HTTP响应(响应行+响应头+空行+消息体):


> 说明:但愿这两张如同泛黄的照片般的截图帮助你大概的了解到HTTP是一个怎样的协议。
...
...
Day66-75/02.数据采集和解析.md
View file @
e4204ed9
...
...
@@ -4,7 +4,7 @@
1.
下载数据 - urllib / requests / aiohttp。
2.
解析数据 - re / lxml / beautifulsoup4(bs4)/ pyquery。
3.
持久化 - pymysql / redis / sqlalchemy / pymongo。
3.
持久化 - pymysql / redis / sqlalchemy / p
eewee / p
ymongo。
4.
调度器 - 进程 / 线程 / 协程。
### HTML页面分析
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment