INDEX
Explanations
the presence of numerical values or specific count-related syntax in the text
New Auto-Interp
Negative Logits
ULT
-0.51
тинг
-0.51
divine
-0.50
zare
-0.49
ダイ
-0.49
cru
-0.49
Live
-0.47
Lázaro
-0.46
сай
-0.46
poved
-0.45
POSITIVE LOGITS
__':
1.10
__":
0.95
ViewFeatures
0.85
مشين
0.84
:✨
0.83
kaarangay
0.78
nakalista
0.78
سكانية
0.78
出版年
0.77
ViewImports
0.77
Activations Density 0.089%