Skip to content
Projects
Groups
Snippets
Help
Loading...
Sign in
Toggle navigation
F
ffm-baseline
Project
Project
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
ML
ffm-baseline
Commits
7a7c45d2
Commit
7a7c45d2
authored
Dec 20, 2018
by
高雅喆
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
bug fix
parent
2a988367
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
7 additions
and
39 deletions
+7
-39
EsmmData.scala
eda/feededa/src/main/scala/com/gmei/EsmmData.scala
+7
-39
No files found.
eda/feededa/src/main/scala/com/gmei/EsmmData.scala
View file @
7a7c45d2
...
...
@@ -474,56 +474,24 @@ object GetPortrait {
val
stat_date
=
param
.
date
val
test1
=
sc
.
sql
(
s
"""
|select a.diary_id,a.tag_id,b.tag_type,b.level1_id,b.level2_id,b.level3_id
| from online.tl_hdfs_diary_tags_view a
| left join online.bl_tag_hierarchy_detail b
| on a.tag_id = b.id
| where a.partition_date = '20181218'
| and b.partition_date = '20181218'
"""
.
stripMargin
)
test1
.
show
()
val
test2
=
sc
.
sql
(
s
"""
|select c.diary_id,c.tag_type,
| concat_ws(',',collect_set(cast(c.level1_id as string))) as level1_ids,
| concat_ws(',',collect_set(cast(c.level2_id as string))) as level2_ids,
| concat_ws(',',collect_set(cast(c.level3_id as string))) as level3_ids from
| (select a.diary_id,a.tag_id,b.tag_type,b.level1_id,b.level2_id,b.level3_id
| from online.tl_hdfs_diary_tags_view a
| left join online.bl_tag_hierarchy_detail b
| on a.tag_id = b.id
| where a.partition_date = '20181218'
| and b.partition_date = '20181218') c
| group by c.diary_id,c.tag_type
"""
.
stripMargin
)
test2
.
show
()
val
diary_tag
=
sc
.
sql
(
s
"""
|select d.diary_id,
|concat_ws(',',collect_set(d.level1_ids)) as level1_ids,
|concat_ws(',',collect_set(d.level2_ids)) as level2_ids,
|concat_ws(',',collect_set(d.level3_ids)) as level3_ids from
| (select c.diary_id,c.tag_type,
|select c.diary_id,
| concat_ws(',',collect_set(cast(c.level1_id as string))) as level1_ids,
| concat_ws(',',collect_set(cast(c.level2_id as string))) as level2_ids,
| concat_ws(',',collect_set(cast(c.level3_id as string))) as level3_ids from
| (select a.diary_id,
a.tag_id,b.tag_type,
b.level1_id,b.level2_id,b.level3_id
| from
online.
tl_hdfs_diary_tags_view a
| left join
online.
bl_tag_hierarchy_detail b
| (select a.diary_id,b.level1_id,b.level2_id,b.level3_id
| from tl_hdfs_diary_tags_view a
| left join bl_tag_hierarchy_detail b
| on a.tag_id = b.id
| where a.partition_date = '${stat_date}'
| and b.partition_date = '${stat_date}') c
| group by c.diary_id,c.tag_type) d
|group by d.diary_id
| group by c.diary_id
"""
.
stripMargin
)
diary_tag
.
show
()
println
(
diary_tag
.
count
())
sc
.
stop
()
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment