Skip to content
Projects
Groups
Snippets
Help
Loading...
Sign in
Toggle navigation
F
ffm-baseline
Project
Project
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
ML
ffm-baseline
Commits
9cd86509
Commit
9cd86509
authored
Nov 06, 2019
by
高雅喆
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
update
parent
58bcd621
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
6 additions
and
4 deletions
+6
-4
gyz_test.py
eda/smart_rank/gyz_test.py
+6
-4
No files found.
eda/smart_rank/gyz_test.py
View file @
9cd86509
...
...
@@ -99,12 +99,14 @@ device_ids_lst_rdd = spark.sparkContext.parallelize(device_info)
result
=
device_ids_lst_rdd
.
repartition
(
100
)
.
map
(
lambda
x
:
get_user_service_portrait
(
x
,
all_word_tags
,
all_tag_tag_type
,
all_3tag_2tag
,
all_tags_name
,
size
=
None
,
pay_time
=
pay_time
))
print
(
result
.
take
(
10
))
result1
=
result
.
map
(
lambda
x
:
(
x
[
0
],
x
[
1
],
x
[
2
]
)
lambda
x
:
(
str
(
x
[
0
]),
str
(
x
[
1
]),
str
(
x
[
2
])
)
)
path
=
"hdfs:///strategy/esmm/"
spark
.
createDataFrame
(
result1
)
.
toDF
(
"device"
,
"search_words"
,
"user_portrait"
)
.
repartition
(
1
)
.
write
.
format
(
"csv"
)
.
save
(
path
=
path
+
"portrait/"
,
mode
=
"overwrite"
)
spark
.
createDataFrame
(
result1
)
.
toDF
(
"device"
,
"search_words"
,
"user_portrait"
)
.
coalesce
(
1
)
.
write
.
format
(
'com.databricks.spark.csv'
)
.
save
(
"~/test_df.csv"
,
header
=
'true'
)
result
.
saveAsTextFile
(
"~/test_df.csv"
)
# path = "hdfs:///strategy/esmm/"
# spark.createDataFrame(result1).toDF("device", "search_words", "user_portrait").repartition(1).write.format("csv").save(path=path + "portrait/", mode="overwrite")
# result.saveAsTextFile("~/test_df.csv")
# df = result.toDF()
# df.show()
# result.write.format('csv').save("~/test_df.csv")
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment