Skip to content
Projects
Groups
Snippets
Help
Loading...
Sign in
Toggle navigation
C
crawler
Project
Project
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
backend
crawler
Commits
7a951ada
Commit
7a951ada
authored
4 years ago
by
litaolemo
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
update
parent
f71eea5b
master
litao
mr/develop/xiaohongshu
soyang
No related merge requests found
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
2 deletions
+3
-2
crawler_zhihu.py
crawler_sys/site_crawler/crawler_zhihu.py
+3
-2
No files found.
crawler_sys/site_crawler/crawler_zhihu.py
View file @
7a951ada
...
...
@@ -48,8 +48,7 @@ class Crawler_zhihu():
self
.
video_data
[
'platform'
]
=
self
.
platform
# remove fields that crawled data don't have
pop_key_Lst
=
[
'channel'
,
'describe'
,
'isOriginal'
,
"repost_count"
,
"video_id"
]
import
pdb
pdb
.
set_trace
()
try
:
with
open
(
'./zhihu.js'
,
'r'
,
encoding
=
'utf-8'
)
as
f
:
js
=
f
.
read
()
...
...
@@ -168,6 +167,8 @@ class Crawler_zhihu():
"d_c0"
:
'"AIDu7_zGrA-PToWVy-siVNLS835i5YXmFCQ=|1562072925"'
,
"KLBRSID"
:
None
}
import
pdb
pdb
.
set_trace
()
cookies_dict
.
update
(
res_cookies_dict
)
url
=
"https://www.zhihu.com/api/v4/search_v3?t=general&q={0}&correction=1&offset=0&limit=20&lc_idx=0&show_all_topics=0"
.
format
(
urllib
.
parse
.
quote
(
keyword
))
...
...
This diff is collapsed.
Click to expand it.
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment