INDEX
Explanations
punctuation marks, particularly quotation marks and other symbols indicating speech or quotation
New Auto-Interp
Negative Logits
İstinadlar
-1.16
Maynard
-0.86
ngua
-0.81
DDG
-0.80
findpost
-0.79
hermosa
-0.79
]=="
-0.78
Horv
-0.77
ंदीखरीदारी
-0.77
ibouti
-0.77
POSITIVE LOGITS
(«
1.24
«
1.16
««
1.16
«
1.14
.«
1.14
?«
1.06
,»
1.06
?»
1.05
)»
1.02
,«
0.97
Activations Density 0.018%