INDEX
Explanations
the presence of quotation marks or apostrophes in the text
New Auto-Interp
Negative Logits
ный
-0.67
tft
-0.66
Harn
-0.66
ة
-0.65
ment
-0.65
Genova
-0.64
Martens
-0.64
éraux
-0.64
Healey
-0.63
CDT
-0.63
POSITIVE LOGITS
’
1.12
''
1.08
SpringBootTest
1.06
:''
0.87
(''0.84
:
0.84
?''
0.83
menistan
0.81
isShow
0.81
Nicky
0.80
Activations Density 0.175%