Skip to content
Projects
Groups
Snippets
Help
Loading...
Sign in
Toggle navigation
G
gm_strategy_cvr
Project
Project
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
rank
gm_strategy_cvr
Commits
055f2a56
Commit
055f2a56
authored
Jul 21, 2020
by
赵威
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
update field
parent
847e44b0
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
9 additions
and
8 deletions
+9
-8
main.py
src/main.py
+4
-4
fe.py
src/models/esmm/fe.py
+5
-4
No files found.
src/main.py
View file @
055f2a56
...
...
@@ -19,15 +19,15 @@ from models.esmm.model import esmm_model_fn, model_export, model_predict
def
main
():
time_begin
=
time
.
time
()
# df = get_device_df_from_redis()
#
df2 = get_diary_df_from_redis()
#
print(df2.sample(1))
df2
=
get_diary_df_from_redis
()
print
(
df2
.
sample
(
1
))
# print(df.size)
# print(df2.size)
# a = device_feature_engineering(df)
# print(a.size)
# b = diary_feature_engineering(df2
)
#
print(b.sample(1))
b
=
diary_feature_engineering
(
df2
,
from_redis
=
True
)
print
(
b
.
sample
(
1
))
device_df
,
diary_df
,
click_df
,
conversion_df
=
read_csv_data
(
Path
(
"~/data/cvr_data/"
))
print
(
diary_df
.
sample
(
1
))
...
...
src/models/esmm/fe.py
View file @
055f2a56
...
...
@@ -66,7 +66,7 @@ def device_feature_engineering(df):
return
device_df
[
device_columns
]
def
diary_feature_engineering
(
df
):
def
diary_feature_engineering
(
df
,
from_redis
=
False
):
diary_df
=
df
.
copy
()
str_bool_map
=
{
"true"
:
True
,
"false"
:
False
}
...
...
@@ -86,9 +86,10 @@ def diary_feature_engineering(df):
diary_df
[
"second_positions"
]
=
diary_df
[
"second_positions"
]
.
apply
(
lambda
d
:
d
if
isinstance
(
d
,
list
)
else
[])
diary_df
[
"projects"
]
=
diary_df
[
"projects"
]
.
apply
(
lambda
d
:
d
if
isinstance
(
d
,
list
)
else
[])
# diary_df["is_pure_author"] = diary_df["is_pure_author"].map(str_bool_map)
# diary_df["is_have_pure_reply"] = diary_df["is_have_pure_reply"].map(str_bool_map)
# diary_df["is_have_reply"] = diary_df["is_have_reply"].map(str_bool_map)
if
from_redis
:
diary_df
[
"is_pure_author"
]
=
diary_df
[
"is_pure_author"
]
.
map
(
str_bool_map
)
diary_df
[
"is_have_pure_reply"
]
=
diary_df
[
"is_have_pure_reply"
]
.
map
(
str_bool_map
)
diary_df
[
"is_have_reply"
]
=
diary_df
[
"is_have_reply"
]
.
map
(
str_bool_map
)
diary_df
[
"is_pure_author"
]
=
diary_df
[
"is_pure_author"
]
.
astype
(
int
)
diary_df
[
"is_have_pure_reply"
]
=
diary_df
[
"is_have_pure_reply"
]
.
astype
(
int
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment