INDEX
Explanations
phrases and words indicating an evaluation or judgment, typically with a negative connotation
New Auto-Interp
Negative Logits
prus
-0.73
tsky
-0.70
rote
-0.70
utm
-0.70
iHUD
-0.69
gypt
-0.68
rain
-0.67
release
-0.66
ortment
-0.65
thur
-0.65
POSITIVE LOGITS
anymore
1.01
enough
0.96
nor
0.96
whatsoever
0.89
Enough
0.78
enough
0.76
anywhere
0.72
either
0.69
anybody
0.68
except
0.67
Activations Density 0.082%