INDEX
Explanations
verbs indicating mental actions
verbs conveying knowledge, awareness, or emotional states
New Auto-Interp
Negative Logits
anwhile
-0.69
Annotations
-0.68
agher
-0.67
java
-0.67
opting
-0.64
merce
-0.63
enegger
-0.63
quart
-0.62
withdrawing
-0.62
arta
-0.62
POSITIVE LOGITS
itself
0.90
occupants
0.66
erers
0.66
erer
0.65
ickets
0.64
lessly
0.64
ibly
0.61
rive
0.60
lur
0.60
wer
0.58
Activations Density 0.676%