1,035
edits
No edit summary |
|||
Line 27: | Line 27: | ||
# sort values so that entries with reasons come first, dedupe | # sort values so that entries with reasons come first, dedupe | ||
dfs.sort_values(['domain','reason']).drop_duplicates(subset=['domain','severity'], keep='first') | # keep duplicate domains that different lists have at different severity levels for manual triage | ||
dfs = dfs.sort_values(['domain','reason']).drop_duplicates(subset=['domain','severity'], keep='first') | |||
# add columns and rename to match import format | # add columns and rename to match import format | ||
Line 49: | Line 48: | ||
That will give you a deduped CSV, but then you still need to go through manually and remove obfuscated domain names and check to make sure the blocks align with what would fit for the instance :) | That will give you a deduped CSV, but then you still need to go through manually and remove obfuscated domain names and check to make sure the blocks align with what would fit for the instance :) | ||
== social.coop == | == social.coop == |