INDEX
Explanations
words related to postal mail and fraud
references to mail-related activities or fraud
New Auto-Interp
Negative Logits
nen
-0.68
oulos
-0.64
LORD
-0.64
whiff
-0.62
ETF
-0.62
Durham
-0.61
iets
-0.61
onomous
-0.61
aughs
-0.60
razil
-0.60
POSITIVE LOGITS
boxes
1.39
bag
1.19
1.16
bags
1.05
mailbox
1.02
box
0.99
0.95
0.87
0.86
inbox
0.85
Activations Density 0.012%