INDEX
Explanations
tokens in mathematical or scientific discourse, like "measure", "using", numbers, and symbols like "g"
scientific terminology
New Auto-Interp
Negative Logits
myſelf
-1.11
himſelf
-1.09
themſelves
-1.01
fubject
-1.00
pleaſure
-0.98
itſelf
-0.98
Monfieur
-0.96
purpoſe
-0.96
deſt
-0.94
Majefty
-0.93
POSITIVE LOGITS
Gra
0.53
work
0.52
}",
0.50
0.48
ra
0.48
de
0.48
Sav
0.48
Sha
0.47
the
0.47
re
0.47
Activations Density 2.453%