Skip to content
Projects
Groups
Snippets
Help
Loading...
Sign in
Toggle navigation
F
ffm-baseline
Project
Project
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
ML
ffm-baseline
Commits
a5904974
Commit
a5904974
authored
Oct 09, 2018
by
高雅喆
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
addd city_id
parent
d1b33ab6
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
7 additions
and
6 deletions
+7
-6
Main.scala
eda/node2vec/src/main/scala/com/gmei/Main.scala
+7
-6
No files found.
eda/node2vec/src/main/scala/com/gmei/Main.scala
View file @
a5904974
...
...
@@ -190,24 +190,25 @@ object Main {
val
device_id
=
sc
.
sql
(
s
"""
|select a.device_id device_id,b.similarity_cid similarity_cid from
|(select device_id,first(cid) as cid from data_feed_click
|select a.device_id device_id,
a.city_id city_id ,
b.similarity_cid similarity_cid from
|(select device_id,
city_id,
first(cid) as cid from data_feed_click
|where cid in (select cid from nd_cid_similarity_matrix)
|group by device_id) a left join
|group by device_id
order by time
) a left join
|nd_cid_similarity_matrix b
|on a.cid = b.cid
|where b.similarity_cid is not null
"""
.
stripMargin
)
device_id
.
na
.
fill
(
Map
(
"city_id"
->
"beijing"
))
device_id
.
show
()
val
device_queue
=
device_id
.
rdd
.
map
{
item
=>
val
parts
=
(
item
.
getAs
[
String
](
fieldName
=
"device_id"
),
item
.
getAs
[
String
](
fieldName
=
"similarity_cid"
))
val
parts
=
(
item
.
getAs
[
String
](
fieldName
=
"device_id"
),
item
.
getAs
[
String
](
fieldName
=
"
city_id"
),
item
.
getAs
[
String
](
fieldName
=
"
similarity_cid"
))
Try
{
(
parts
.
_1
,
Try
(
parts
.
_2
.
toString
.
replace
(
"diary|"
,
""
)).
getOrElse
(
null
))
(
parts
.
_1
,
Try
(
parts
.
_2
.
toString
.
replace
(
"worldwide"
,
"beijing"
)),
Try
(
parts
.
_3
.
toString
.
replace
(
"diary|"
,
""
)).
getOrElse
(
null
))
}.
getOrElse
(
null
)
}.
filter
(
_
!=
null
).
toDF
(
"device_id"
,
"similarity_cid"
)
}.
filter
(
_
!=
null
).
toDF
(
"device_id"
,
"
city_id"
,
"
similarity_cid"
)
device_queue
.
take
(
20
).
foreach
(
println
)
GmeiConfig
.
writeToJDBCTable
(
device_queue
,
table
=
"nd_device_cid_similarity_matrix"
,
SaveMode
.
Overwrite
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment