INDEX
Explanations
articles and modifiers in various forms
New Auto-Interp
Negative Logits
itſelf
-0.84
Shakspeare
-0.78
myſelf
-0.75
་་
-0.74
AndEndTag
-0.73
Jefus
-0.73
JsonHelper
-0.72
IsMutable
-0.71
Majefty
-0.71
Cæsar
-0.70
POSITIVE LOGITS
level
0.71
court
0.56
parent
0.55
the
0.54
través
0.53
tingkat
0.52
full
0.52
escala
0.51
partir
0.51
be
0.50
Activations Density 0.020%