INDEX
Explanations
expressions of agreement
instances of agreement or consensus expressed in the text
New Auto-Interp
Negative Logits
akin
-0.78
chin
-0.71
Dise
-0.70
quer
-0.68
ãĤ¼ãĤ¦ãĤ¹
-0.67
ornia
-0.65
netted
-0.65
Scope
-0.62
phy
-0.62
ponds
-0.62
POSITIVE LOGITS
regards
1.10
regard
1.01
respect
0.84
standing
0.78
ĪĴ
0.73
hus
0.71
asper
0.70
stood
0.67
unanimous
0.64
them
0.64
Activations Density 0.061%