INDEX
Explanations
questions or inquiries regarding reasons and explanations
"why" followed by a pronoun
New Auto-Interp
Negative Logits
Them
-0.58
Cæsar
-0.58
vielä
-0.56
alfo
-0.56
Makefile
-0.54
Pelop
-0.53
Makefile
-0.52
stdc
-0.52
šak
-0.50
Roskov
-0.49
POSITIVE LOGITS
we
1.50
they
1.47
there
1.35
the
1.17
it
1.12
you
1.11
he
1.10
someone
0.94
things
0.92
some
0.92
Activations Density 0.478%