Skip to content
Projects
Groups
Snippets
Help
Loading...
Sign in
Toggle navigation
F
ffm-baseline
Project
Project
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
ML
ffm-baseline
Commits
e6eb6830
Commit
e6eb6830
authored
Apr 16, 2019
by
王志伟
Browse files
Options
Browse Files
Download
Plain Diff
Merge branch 'master' of
http://git.wanmeizhensuo.com/ML/ffm-baseline
parents
9e356afa
3a44f1b9
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
6 additions
and
3 deletions
+6
-3
feature.py
tensnsorflow/es/feature.py
+6
-3
No files found.
tensnsorflow/es/feature.py
View file @
e6eb6830
...
@@ -69,8 +69,8 @@ def get_data():
...
@@ -69,8 +69,8 @@ def get_data():
hospital
=
con_sql
(
db
,
sql
)
hospital
=
con_sql
(
db
,
sql
)
hospital
=
hospital
.
rename
(
columns
=
{
0
:
"service_id"
,
1
:
"hospital_id"
})
hospital
=
hospital
.
rename
(
columns
=
{
0
:
"service_id"
,
1
:
"hospital_id"
})
# print(hospital.head())
# print(hospital.head())
print
(
"hospital"
)
#
print("hospital")
print
(
hospital
.
count
())
#
print(hospital.count())
hospital
[
"service_id"
]
=
hospital
[
"service_id"
]
.
astype
(
"str"
)
hospital
[
"service_id"
]
=
hospital
[
"service_id"
]
.
astype
(
"str"
)
df
=
pd
.
merge
(
df
,
hospital
,
on
=
'service_id'
,
how
=
'left'
)
df
=
pd
.
merge
(
df
,
hospital
,
on
=
'service_id'
,
how
=
'left'
)
df
=
df
.
drop
(
"service_id"
,
axis
=
1
)
df
=
df
.
drop
(
"service_id"
,
axis
=
1
)
...
@@ -80,7 +80,10 @@ def get_data():
...
@@ -80,7 +80,10 @@ def get_data():
print
(
df
.
shape
)
print
(
df
.
shape
)
df
=
df
.
drop_duplicates
([
"ucity_id"
,
"clevel2_id"
,
"ccity_name"
,
"device_type"
,
"manufacturer"
,
df
=
df
.
drop_duplicates
([
"ucity_id"
,
"clevel2_id"
,
"ccity_name"
,
"device_type"
,
"manufacturer"
,
"channel"
,
"top"
,
"time"
,
"stat_date"
,
"app_list"
,
"hospital_id"
,
"level3_ids"
])
"channel"
,
"top"
,
"time"
,
"stat_date"
,
"app_list"
])
# df = df.drop_duplicates(["ucity_id", "clevel2_id", "ccity_name", "device_type", "manufacturer",
# "channel", "top", "time", "stat_date", "app_list", "hospital_id", "level3_ids"])
print
(
"去重后样本数量:"
,
df
.
shape
)
print
(
"去重后样本数量:"
,
df
.
shape
)
app_list_number
,
app_list_map
=
multi_hot
(
df
,
"app_list"
,
2
)
app_list_number
,
app_list_map
=
multi_hot
(
df
,
"app_list"
,
2
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment