INDEX
Explanations
the use of quotation marks or apostrophes in the text
New Auto-Interp
Negative Logits
Xna
-0.92
ContentAlignment
-0.83
ochond
-0.78
CONS
-0.77
{}".-0.75
komp
-0.74
Cochrane
-0.73
Vidite
-0.71
Xd
-0.70
UpInside
-0.66
POSITIVE LOGITS
‚
1.28
‘
1.19
(‘
1.01
‗
0.96
’
0.96
‘
0.94
'
0.93
(‘
0.92
、『
0.89
=’
0.85
Activations Density 0.102%