INDEX
Explanations
the beginning of a document or new section
New Auto-Interp
Negative Logits
disambiguazione
-0.92
########.
-0.89
MenuView
-0.89
IsContent
-0.85
دانشنامهٔ
-0.83
itſelf
-0.82
Abitanti
-0.82
themſelves
-0.79
windowFixed
-0.78
Monfieur
-0.78
POSITIVE LOGITS
L
0.55
i
0.54
I
0.53
J
0.52
ๆ
0.52
ਮ
0.51
يتيمه
0.50
A
0.50
지
0.50
spunkt
0.49
Activations Density 0.024%