INDEX
Explanations
specific abbreviations or shorthand associated with technical terms and references
Abbreviations after initial letters
abbreviations followed by short codes
New Auto-Interp
Negative Logits
f
-0.63
-0.60
la
-0.60
z
-0.60
sa
-0.60
op
-0.59
pa
-0.59
er
-0.58
der
-0.58
y
-0.57
POSITIVE LOGITS
myſelf
1.25
ſelf
1.19
raiſ
1.18
Jefus
1.18
purpoſe
1.17
pleaſure
1.17
Anſ
1.16
ſever
1.14
houſe
1.13
ſmall
1.12
Activations Density 0.427%