INDEX
Explanations
punctuation used in dialogue or quotations
New Auto-Interp
Negative Logits
CLK
-0.96
GenerationType
-0.81
?>">
-0.79
Schro
-0.76
Paro
-0.76
@@@@@@@@
-0.75
Moos
-0.75
Maru
-0.74
Babylon
-0.74
Valer
-0.74
POSITIVE LOGITS
,&
0.80
0.80
\,\
0.79
,,
0.78
,"
0.78
,’
0.76
,'
0.76
,\
0.76
,,,
0.75
gments
0.75
Activations Density 0.111%