INDEX
Explanations
words related to fraudulent activities or schemes
occurrences of the word "scam" and related terms
New Auto-Interp
Negative Logits
FK
-0.68
Atom
-0.68
IFE
-0.67
Borders
-0.63
itarian
-0.62
hani
-0.61
Olive
-0.61
Differences
-0.60
temperature
-0.57
éĹĺ
-0.57
POSITIVE LOGITS
ulent
1.31
ulence
1.11
pering
1.07
sters
1.05
crow
0.92
ulously
0.90
perpetrated
0.89
atown
0.89
scam
0.87
ster
0.87
Activations Density 0.049%