INDEX
Explanations
first-person expressions of personal opinions or preferences
expressions of personal opinions or subjective statements
New Auto-Interp
Negative Logits
Meadow
-0.84
Clintons
-0.67
TED
-0.67
Roses
-0.65
iens
-0.65
Freeman
-0.64
Gulf
-0.64
Butter
-0.63
shadows
-0.62
Lim
-0.61
POSITIVE LOGITS
Personally
1.05
etheless
1.04
Personally
1.04
odox
0.92
isSpecialOrderable
0.83
minded
0.80
cest
0.77
opin
0.76
ographically
0.75
wagen
0.74
Activations Density 0.006%