INDEX
Explanations
phrases indicating a subjective evaluation or perception of a situation or idea
phrases that indicate perception or subjective judgment about various subjects
New Auto-Interp
Negative Logits
aughs
-0.71
mers
-0.68
urrence
-0.67
anwhile
-0.64
rowing
-0.63
cised
-0.62
isin
-0.60
ioch
-0.60
Strikes
-0.60
Hunts
-0.60
POSITIVE LOGITS
innocuous
1.05
daunting
1.00
intimidating
0.93
intuitive
0.92
confusing
0.90
insignificant
0.89
intuitive
0.87
contradictory
0.86
trivial
0.86
superf
0.85
Activations Density 0.064%