INDEX
Explanations
key concepts related to choice and decision-making in educational contexts
New Auto-Interp
Negative Logits
icina
-0.17
:normal
-0.16
veis
-0.15
him
-0.15
lÃł
-0.14
zbek
-0.14
á¿¶
-0.14
herself
-0.14
Ihnen
-0.14
chner
-0.14
POSITIVE LOGITS
that
0.39
they
0.32
we
0.31
that
0.29
mÃł
0.28
he
0.24
you
0.22
thats
0.22
everyone
0.21
someone
0.21
Activations Density 1.027%