INDEX
Explanations
notions of patience and understanding in social contexts
New Auto-Interp
Negative Logits
_ISO
-0.17
uky
-0.16
urst
-0.16
usher
-0.15
PartialView
-0.15
pom
-0.15
ikel
-0.15
verz
-0.14
egov
-0.14
ypse
-0.14
POSITIVE LOGITS
ney
0.16
unw
0.16
sey
0.14
dispers
0.14
tol
0.14
Silent
0.14
argent
0.14
Moore
0.14
ÅĤo
0.14
vert
0.14
Activations Density 0.240%