INDEX
Explanations
references to actions and perceptions related to human experience
New Auto-Interp
Negative Logits
majánló
-1.15
ſelves
-1.11
Majefty
-1.05
<unused74>
-1.05
<unused28>
-1.04
<unused8>
-1.04
<unused41>
-1.04
<unused43>
-1.04
[@BOS@]
-1.04
<unused3>
-1.04
POSITIVE LOGITS
[
0.34
prijs
0.31
toepassing
0.31
gevallen
0.31
skall
0.31
genomen
0.30
[
0.29
trecho
0.29
numai
0.28
vaiz
0.27
Activations Density 0.017%