Skip to content
Projects
Groups
Snippets
Help
Loading...
Sign in
Toggle navigation
F
ffm-baseline
Project
Project
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
ML
ffm-baseline
Commits
bc296a69
Commit
bc296a69
authored
Nov 19, 2018
by
王志伟
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
repair bug
parent
56bc651a
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
15 additions
and
4 deletions
+15
-4
app_list.scala
eda/feededa/src/main/scala/com/gmei/app_list.scala
+15
-4
No files found.
eda/feededa/src/main/scala/com/gmei/app_list.scala
View file @
bc296a69
package
com.gmei
import
java.io.Serializable
import
org.apache.spark.sql.functions.udf
import
com.gmei.WeafareStat.
{
defaultParams
,
parser
}
import
org.apache.spark.sql.
{
SaveMode
,
TiContext
}
...
...
@@ -61,6 +62,15 @@ object app_list {
val
partition_date
=
param
.
date
.
replace
(
"-"
,
""
)
println
(
partition_date
)
//自定义udf函数,增加dataframe 列
val
code
=
(
arg
:
String
)
=>
{
if
(
arg
.
getClass
.
getName
==
"java.lang.String"
)
partition_date
else
0
}
val
addCol
=
udf
(
code
)
//以上为udf函数
//获取策略命中用户device_id
val
app_list
=
sc
.
sql
(
s
"""
...
...
@@ -72,10 +82,10 @@ object app_list {
)
//app_list.show()
import
sc.implicits._
val
rdd
=
app_list
.
rdd
.
map
(
x
=>(
x
(
0
).
toString
,
x
(
1
).
toString
))
.
filter
(
x
=>
x
.
_2
.
contains
(
"新氧美容"
)).
map
(
x
=>
x
.
_1
).
collect
().
toList
.
toDF
()
rdd
.
show
()
rdd
.
createOrReplaceTempView
(
"device_id"
)
val
rdd
_df
=
app_list
.
rdd
.
map
(
x
=>(
x
(
0
).
toString
,
x
(
1
).
toString
))
.
filter
(
x
=>
x
.
_2
.
contains
(
"新氧美容"
)).
map
(
x
=>
x
.
_1
).
collect
().
toList
.
toDF
(
"device_id"
)
rdd
_df
.
show
()
rdd
_df
.
createOrReplaceTempView
(
"device_id"
)
val
temp
=
sc
.
sql
(
s
"""
...
...
@@ -83,6 +93,7 @@ object app_list {
|from device_id
"""
.
stripMargin
)
temp
.
withColumn
(
"stat_date"
,
addCol
(
temp
(
"device_id"
)))
temp
.
show
()
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment