Skip to content

Commit

Permalink
filter reviews without featuees
Browse files Browse the repository at this point in the history
  • Loading branch information
quim-motger committed Dec 3, 2024
1 parent 7497257 commit 4a662ea
Show file tree
Hide file tree
Showing 7 changed files with 2 additions and 49,168 deletions.
968 changes: 0 additions & 968 deletions data/Stage 1 - Feature extraction/output/com.discord.csv

Large diffs are not rendered by default.

4,487 changes: 0 additions & 4,487 deletions data/Stage 1 - Feature extraction/output/com.google.android.apps.bard.csv

Large diffs are not rendered by default.

876 changes: 0 additions & 876 deletions data/Stage 1 - Feature extraction/output/com.microsoft.copilot.csv

Large diffs are not rendered by default.

14,260 changes: 0 additions & 14,260 deletions data/Stage 1 - Feature extraction/output/com.openai.chatgpt.csv

Large diffs are not rendered by default.

25,653 changes: 0 additions & 25,653 deletions data/Stage 1 - Feature extraction/output/com.whatsapp.csv

Large diffs are not rendered by default.

2,923 changes: 0 additions & 2,923 deletions data/Stage 1 - Feature extraction/output/org.telegram.messenger.csv

Large diffs are not rendered by default.

Original file line number Diff line number Diff line change
Expand Up @@ -45,7 +45,8 @@ def parse_and_add_column(csv_folder, json_folder, output_folder):

# Add the 'extracted_features_TransFeatEx' column
df['extracted_features_TransFeatEx'] = df['reviewId'].map(review_features_map)

# Filter out rows where extracted_features_TransFeatEx is empty
df = df[df['extracted_features_TransFeatEx'].notna() & (df['extracted_features_TransFeatEx'] != '')]
# Save the updated DataFrame to the output folder
output_path = os.path.join(output_folder, csv_file)
df.to_csv(output_path, sep=',', index=False)
Expand Down

0 comments on commit 4a662ea

Please sign in to comment.