INDEX
Explanations
phrases expressing agreement with statements
phrases indicating agreement
New Auto-Interp
Negative Logits
akin
-0.74
Dise
-0.71
gins
-0.68
chin
-0.67
liner
-0.66
liners
-0.64
quer
-0.63
mite
-0.63
numbered
-0.62
bodied
-0.61
POSITIVE LOGITS
regards
0.98
regard
0.93
¬¼
0.75
asper
0.71
ibel
0.70
sentiments
0.69
ĪĴ
0.69
ogun
0.68
abus
0.67
hus
0.66
Activations Density 0.056%