Skip to content
Projects
Groups
Snippets
Help
Loading...
Sign in
Toggle navigation
F
ffm-baseline
Project
Project
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
ML
ffm-baseline
Commits
1b5adb17
Commit
1b5adb17
authored
Oct 14, 2019
by
张彦钊
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
按照渠道跑数据
parent
63ef9d59
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
30 additions
and
30 deletions
+30
-30
meigou.py
local/meigou.py
+30
-30
No files found.
local/meigou.py
View file @
1b5adb17
...
...
@@ -280,36 +280,36 @@ if __name__ == '__main__':
.
set
(
"spark.driver.maxResultSize"
,
"8g"
)
.
set
(
"spark.sql.avro.compression.codec"
,
"snappy"
)
spark
=
SparkSession
.
builder
.
config
(
conf
=
sparkConf
)
.
enableHiveSupport
()
.
getOrCreate
()
#
for os in ["ios","android"]:
#
all_list = []
# for i in range(1,3
):
#
date_str = (datetime.date.today() - datetime.timedelta(days=i)).strftime("%Y%m%d")
#
tmp_list = [date_str]
#
tmp_list.extend(os_all_click(i,os))
#
tmp_list.extend(os_cpc_click(i,os))
#
all_list.append(tmp_list)
#
df = pd.DataFrame(all_list)
#
df = df.rename(columns={0: "date",1: "search", 2: "xiangguan",3:"home",4:"service_home",
#
5: "all_clcik",
#
6: "cpc_search", 7: "cpc_xiangguan",8:"cpc_home",9:"cpc_service_home",
#
10:"cpc_all"})
#
df.to_csv('/home/gmuser/cpc_{}.csv'.format(os), index=False)
all_list
=
[]
for
i
in
range
(
1
,
4
):
date_str
=
(
datetime
.
date
.
today
()
-
datetime
.
timedelta
(
days
=
i
))
.
strftime
(
"
%
Y
%
m
%
d"
)
tmp_list
=
[
date_str
]
tmp_list
.
extend
(
all_click
(
i
))
tmp_list
.
extend
(
cpc_click
(
i
))
all_list
.
append
(
tmp_list
)
df
=
pd
.
DataFrame
(
all_list
)
df
=
df
.
rename
(
columns
=
{
0
:
"date"
,
1
:
"search"
,
2
:
"xiangguan"
,
3
:
"home"
,
4
:
"service_home"
,
5
:
"all_clcik"
,
6
:
"cpc_search"
,
7
:
"cpc_xiangguan"
,
8
:
"cpc_home"
,
9
:
"cpc_service_home"
,
10
:
"cpc_all"
})
df
.
to_csv
(
'/home/gmuser/cpc_1011.csv'
,
index
=
False
)
for
os
in
[
"ios"
,
"android"
]:
all_list
=
[]
for
i
in
range
(
1
,
21
):
date_str
=
(
datetime
.
date
.
today
()
-
datetime
.
timedelta
(
days
=
i
))
.
strftime
(
"
%
Y
%
m
%
d"
)
tmp_list
=
[
date_str
]
tmp_list
.
extend
(
os_all_click
(
i
,
os
))
tmp_list
.
extend
(
os_cpc_click
(
i
,
os
))
all_list
.
append
(
tmp_list
)
df
=
pd
.
DataFrame
(
all_list
)
df
=
df
.
rename
(
columns
=
{
0
:
"date"
,
1
:
"search"
,
2
:
"xiangguan"
,
3
:
"home"
,
4
:
"service_home"
,
5
:
"all_clcik"
,
6
:
"cpc_search"
,
7
:
"cpc_xiangguan"
,
8
:
"cpc_home"
,
9
:
"cpc_service_home"
,
10
:
"cpc_all"
})
df
.
to_csv
(
'/home/gmuser/cpc_{}.csv'
.
format
(
os
),
index
=
False
)
#
all_list = []
#
for i in range(1, 4):
#
date_str = (datetime.date.today() - datetime.timedelta(days=i)).strftime("%Y%m%d")
#
tmp_list = [date_str]
#
tmp_list.extend(all_click(i))
#
tmp_list.extend(cpc_click(i))
#
all_list.append(tmp_list)
#
#
df = pd.DataFrame(all_list)
#
#
df = df.rename(columns={0: "date",1: "search", 2: "xiangguan",3:"home",4:"service_home",
#
5: "all_clcik",
#
6: "cpc_search", 7: "cpc_xiangguan",8:"cpc_home",9:"cpc_service_home",
#
10:"cpc_all"})
#
df.to_csv('/home/gmuser/cpc_1011.csv', index=False)
spark
.
stop
()
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment