INDEX
Explanations
actions and interactions between characters
New Auto-Interp
Negative Logits
purpoſe
-1.03
houſe
-0.97
greateſt
-0.96
ſtate
-0.95
beſt
-0.95
faſt
-0.95
myſelf
-0.94
Roskov
-0.93
pleaſure
-0.92
itſelf
-0.92
POSITIVE LOGITS
a
0.59
0.56
Long
0.51
Kaieteur
0.50
B
0.49
for
0.49
K
0.49
to
0.48
also
0.48
on
0.48
Activations Density 0.147%