INDEX
Explanations
text that contains coding or programming elements
New Auto-Interp
Negative Logits
'
-0.66
UnusedPrivate
-0.65
‘
-0.57
&
-0.56
`
-0.53
(
-0.52
V
-0.52
/
-0.50
windowFixed
-0.49
v
-0.49
POSITIVE LOGITS
houſe
1.10
ſelf
1.09
purpoſe
1.07
ſelves
1.06
Majefty
1.03
་་
1.01
myſelf
0.97
itſelf
0.97
Efq
0.96
Diſ
0.94
Activations Density 0.029%