Skip to content
Projects
Groups
Snippets
Help
Loading...
Sign in
Toggle navigation
F
ffm-baseline
Project
Project
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
ML
ffm-baseline
Commits
7f122038
Commit
7f122038
authored
Dec 20, 2018
by
王志伟
Browse files
Options
Browse Files
Download
Plain Diff
Merge branch 'master' of
http://git.wanmeizhensuo.com/ML/ffm-baseline
parents
a2aff4c6
7a7c45d2
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
12 additions
and
13 deletions
+12
-13
EsmmData.scala
eda/feededa/src/main/scala/com/gmei/EsmmData.scala
+12
-13
No files found.
eda/feededa/src/main/scala/com/gmei/EsmmData.scala
View file @
7f122038
...
...
@@ -472,27 +472,26 @@ object GetPortrait {
val
ti
=
new
TiContext
(
sc
)
ti
.
tidbMapTable
(
dbName
=
"jerry_prod"
,
tableName
=
"data_feed_click"
)
val
stat_date
=
param
.
date
val
diary_tag
=
sc
.
sql
(
s
"""
|select d.diary_id,
|(case when d.tag_type = '1' then d.level1_ids else "" end) level1_ids,
|(case when d.tag_type = '2' then d.level2_ids else "" end) level2_ids,
|(case when d.tag_type = '3' then d.level3_ids else "" end) level3_ids from
| (select c.diary_id,c.tag_type,
| concat_ws(c.level1_id) as level1_ids
| concat_ws(c.level2_id) as level2_ids
| concat_ws(c.level3_id) as level3_ids from
| (select a.diary_id,a.tag_id,b.tag_type,b.level1_id,b.level2_id,b.level3_id
|select c.diary_id,
| concat_ws(',',collect_set(cast(c.level1_id as string))) as level1_ids,
| concat_ws(',',collect_set(cast(c.level2_id as string))) as level2_ids,
| concat_ws(',',collect_set(cast(c.level3_id as string))) as level3_ids from
| (select a.diary_id,b.level1_id,b.level2_id,b.level3_id
| from tl_hdfs_diary_tags_view a
| left join bl_tag_hierarchy_detail b
| on a.tag_id = b.id
| where a.partition_date = '20181218'
| and b.partition_date = '20181218') c
| group by c.diary_id,c.tag_type) d
|group by d.diary_id
| where a.partition_date = '${stat_date}'
| and b.partition_date = '${stat_date}') c
| group by c.diary_id
"""
.
stripMargin
)
diary_tag
.
show
()
println
(
diary_tag
.
count
())
sc
.
stop
()
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment