INDEX
Explanations
texts related to spam and potential scams
references to spam and scams in the text
New Auto-Interp
Negative Logits
hani
-1.10
Borders
-0.73
cider
-0.72
ederal
-0.69
IFE
-0.66
Dayton
-0.64
Judges
-0.63
Differences
-0.63
Difference
-0.63
Governors
-0.62
POSITIVE LOGITS
ming
1.42
mer
1.23
mers
1.02
ulent
1.00
pering
1.00
crow
0.99
my
0.95
pling
0.91
vertising
0.89
bara
0.89
Activations Density 0.074%