INDEX
Explanations
references to structured or mathematical notations in scientific text
New Auto-Interp
Negative Logits
itſelf
-0.92
himſelf
-0.88
للمعارف
-0.88
Theſe
-0.88
myſelf
-0.85
themſelves
-0.84
auffi
-0.82
Monfieur
-0.82
resourceCulture
-0.81
pleaſure
-0.81
POSITIVE LOGITS
guchi
0.71
gdx
0.65
co
0.59
Wilber
0.57
figure
0.56
overset
0.56
cal
0.56
La
0.56
la
0.56
La
0.55
Activations Density 0.071%