INDEX
Explanations
contractions and possessives
New Auto-Interp
Negative Logits
“
-1.75
’
-1.70
‘
-1.67
”
-1.54
’,
-1.50
.’
-1.45
’.
-1.40
.”
-1.37
,’
-1.36
“
-1.34
POSITIVE LOGITS
^(@)
1.40
Efq
1.39
-"
1.39
。"
1.37
Jefus
1.34
...'
1.26
Majefty
1.24
poffible
1.24
itſelf
1.23
pleaf
1.23
Activations Density 1.233%