INDEX
Explanations
punctuation marks that signify dialogue or quoted speech
New Auto-Interp
Negative Logits
ècie
-0.67
Sex
-0.65
estima
-0.64
PL
-0.64
GeneratedCode
-0.63
плек
-0.62
ThemeOverlay
-0.62
Fle
-0.60
drivers
-0.60
Ro
-0.58
POSITIVE LOGITS
?"
1.09
?”
1.07
??"
1.00
,”
0.99
,"
0.96
,’”
0.96
?!"
0.94
.'"
0.92
."
0.90
?!”
0.90
Activations Density 1.028%