INDEX
Explanations
short or descriptive words followed by nouns
New Auto-Interp
Negative Logits
Most
-0.54
most
-0.50
A
-0.48
المعيارى
-0.48
Most
-0.48
Lots
-0.46
et
-0.45
lotes
-0.45
,
-0.43
in
-0.43
POSITIVE LOGITS
Efq
1.34
pleaſure
1.17
Jefus
1.16
purpoſe
1.15
ArrowToggle
1.12
Monfieur
1.12
Majefty
1.09
Shakspeare
1.09
itſelf
1.08
Chriftian
1.06
Activations Density 1.540%