INDEX
Explanations
sequences of underscores or whitespace characters
Variables ending in "perf"
variable assignments
New Auto-Interp
Negative Logits
/
-0.79
–
-0.77
-0.72
—
-0.69
-
-0.66
His
-0.65
A
-0.65
I
-0.64
Tr
-0.63
t
-0.62
POSITIVE LOGITS
myſelf
1.39
itſelf
1.25
greateſt
1.20
pleaſure
1.19
Roskov
1.17
ſche
1.13
doubtnut
1.12
crdi
1.10
Monfieur
1.10
ſtate
1.09
Activations Density 0.683%