INDEX
Explanations
occurrences of quotation marks in the text
New Auto-Interp
Negative Logits
ajur
-0.75
aData
-0.69
fleisch
-0.69
Fergus
-0.68
likle
-0.67
的很
-0.65
Ortiz
-0.64
Percival
-0.64
Gier
-0.62
Viitteet
-0.62
POSITIVE LOGITS
",
1.46
)",
1.43
?",
1.35
'",
1.28
]",
1.26
)".
1.24
}",
1.23
,",
1.21
$",
1.18
\"",
1.18
Activations Density 0.103%