INDEX
Explanations
references to specific objects or entities, possibly in a mathematical or programming context
New Auto-Interp
Negative Logits
Theſe
-0.99
Beſ
-0.86
་་
-0.84
myſelf
-0.82
himſelf
-0.78
itſelf
-0.75
faſt
-0.74
neſs
-0.72
Monfieur
-0.72
viſ
-0.72
POSITIVE LOGITS
O
1.61
o
1.45
O
1.44
oocytes
1.18
o
1.05
afone
1.03
nO
1.01
cO
0.98
О
0.90
E
0.89
Activations Density 0.091%