Commit 37a31760 authored by 王志伟's avatar 王志伟

统计多天重复日记数量,增加过滤个人黑名单

parent c3caf28b
......@@ -321,6 +321,7 @@ object Repeated_content_recommendation_moreday {
|where cid_type = 'diary'
|and stat_date >='${date}'
|group by device_id,stat_date
|and device_id not in (select device_id from blacklist)
""".stripMargin
).rdd.map(row=>(row(0).toString,row(1).toString,row(2).toString)).map(row=>(row._2,row._3)).groupByKey()
.filter(x => x._2.size >1)
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment