INDEX
Explanations
introductions or definitions
New Auto-Interp
Negative Logits
yada
1.35
etc
1.20
inoltre
1.20
なども
1.14
plz
1.11
др
1.10
bonus
1.09
ebenfalls
1.09
برضو
1.07
etc
1.07
POSITIVE LOGITS
The
1.42
The
1.26
:
1.23
What
1.22
By
1.15
For
1.15
And
1.13
What
1.12
?
1.10
–
1.09
Activations Density 0.594%