INDEX
Explanations
references to personal accomplishments and participatory actions
New Auto-Interp
Negative Logits
Fifty
-0.20
Forty
-0.18
Hundred
-0.17
Thirty
-0.16
twentieth
-0.16
Thirty
-0.15
Sevent
-0.14
ovky
-0.14
Nin
-0.14
illions
-0.13
POSITIVE LOGITS
three
0.94
four
0.93
five
0.88
six
0.85
seven
0.79
two
0.78
eight
0.75
three
0.73
nine
0.67
five
0.66
Activations Density 2.826%