INDEX
Explanations
instances of structured data or coding related to programming and configurations
New Auto-Interp
Negative Logits
ба
-0.50
c
-0.49
u
-0.48
by
-0.47
-0.47
ha
-0.47
nak
-0.46
sk
-0.45
-0.44
ll
-0.44
POSITIVE LOGITS
itſelf
1.17
ſelves
1.12
ſelf
1.10
myſelf
1.03
pleaſure
1.02
purpoſe
1.00
ſmall
0.97
BibitemShut
0.97
Efq
0.96
脚注の使い方
0.96
Activations Density 0.003%