INDEX
Explanations
email-related phrases and terms
New Auto-Interp
Negative Logits
REL
-0.65
anova
-0.63
ingred
-0.62
ahoo
-0.60
AVG
-0.60
caps
-0.59
clicks
-0.57
uties
-0.57
wcs
-0.57
XL
-0.57
POSITIVE LOGITS
0.84
coli
0.84
vironment
0.82
mails
0.76
tainment
0.74
ionage
0.70
theless
0.68
cigarette
0.67
Reloaded
0.66
hardt
0.66
Activations Density 1.097%