INDEX
Explanations
sentences containing comparisons or evaluations of likelihood
phrases expressing perceptions or judgments about situations
New Auto-Interp
Negative Logits
isner
-0.73
srfAttach
-0.67
ogh
-0.65
pour
-0.64
ioch
-0.64
redo
-0.64
chance
-0.63
hurst
-0.62
addin
-0.62
orously
-0.60
POSITIVE LOGITS
blush
0.78
innocuous
0.76
superf
0.74
daunting
0.69
confusing
0.69
confused
0.68
differently
0.68
acron
0.67
slightly
0.66
INGS
0.66
Activations Density 0.084%