INDEX
Explanations
numeric values or references to measurements and scientific data
Numbers within or after references
citations and numbers
New Auto-Interp
Negative Logits
ⓧ
-0.98
للمعارف
-0.96
tvguidetime
-0.84
createState
-0.83
GoogleFonts
-0.82
autorytatywna
-0.82
awtextra
-0.81
كومونز
-0.80
***!
-0.78
resourceCulture
-0.78
POSITIVE LOGITS
0.57
3
0.56
<eos>
0.51
2
0.51
7
0.50
9
0.50
5
0.50
1
0.47
4
0.46
8
0.46
Activations Density 0.112%