INDEX
Explanations
instances of quotation marks and dialogue in the text
opening quote marks
New Auto-Interp
Negative Logits
propOrder
-1.00
queſta
-0.88
ロウィン
-0.84
ujednoznacz
-0.82
bootstrapcdn
-0.81
ſelben
-0.80
müſſen
-0.79
مرئيه
-0.79
<unused41>
-0.78
<unused68>
-0.78
POSITIVE LOGITS
“
0.85
"
0.71
"
0.69
("0.59
The
0.56
'
0.56
“
0.56
‘
0.54
If
0.53
「
0.53
Activations Density 0.002%