INDEX
Explanations
specific symbols or mathematical notation used in equations
New Auto-Interp
Negative Logits
flo
-0.52
fla
-0.49
fio
-0.47
Word
-0.45
</strong>
-0.44
dise
-0.44
zin
-0.44
nextLine
-0.44
Tham
-0.44
AllAfrica
-0.43
POSITIVE LOGITS
Monfieur
1.20
myſelf
1.10
quæ
1.07
himſelf
1.06
purpoſe
1.05
chofe
0.98
themſelves
0.95
feroit
0.94
itſelf
0.94
ainfi
0.92
Activations Density 0.005%