Skip to content
Projects
Groups
Snippets
Help
Loading...
Sign in
Toggle navigation
F
ffm-baseline
Project
Project
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
ML
ffm-baseline
Commits
a483e2b0
Commit
a483e2b0
authored
Nov 28, 2018
by
王志伟
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
新统计需求
parent
5780f800
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
1 deletion
+2
-1
strategy_other.scala
eda/feededa/src/main/scala/com/gmei/strategy_other.scala
+2
-1
No files found.
eda/feededa/src/main/scala/com/gmei/strategy_other.scala
View file @
a483e2b0
...
...
@@ -9,6 +9,7 @@ import scopt.OptionParser
import
com.gmei.lib.AbstractParams
//import org.apache.hadoop.hive.ql.exec.spark.session.SparkSession
import
org.apache.spark.sql.SparkSession
import
org.apache.spark.sql.functions._
object
strategy_other
{
...
...
@@ -259,7 +260,7 @@ object diary_exposure {
val
final_cid_city
=
diary_id_temp
.
join
(
df_cid_city
,
Seq
(
"diary_id"
),
"left_outer"
).
na
.
drop
()
final_cid_city
.
show
()
final_cid_city
.
groupBy
(
"name"
).
count
().
show
(
30
)
final_cid_city
.
groupBy
(
"name"
).
count
().
orderBy
(
desc
(
"count"
)
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment