INDEX
Explanations
quoted speech or dialogue
opening quotation marks
New Auto-Interp
Negative Logits
rungsseite
-0.90
ujednoznacz
-0.89
ロウィン
-0.70
BibitemShut
-0.66
careous
-0.64
<unused41>
-0.63
ſſung
-0.63
<unused16>
-0.63
<unused8>
-0.63
[@BOS@]
-0.63
POSITIVE LOGITS
“
0.88
"
0.57
„
0.42
«
0.42
‘
0.42
("0.41
<bos>
0.36
('0.36
「
0.36
"(
0.35
Activations Density 0.029%