INDEX
Explanations
dialogue punctuated by exclamations and questions
New Auto-Interp
Negative Logits
ciplinary
-0.72
ancial
-0.72
aukee
-0.71
conservancy
-0.71
otype
-0.70
utterstock
-0.70
¥ŀ
-0.68
oÄŁ
-0.67
isphere
-0.67
segreg
-0.67
POSITIVE LOGITS
Suddenly
1.02
Hearing
0.94
âĢķ
0.91
ãĢĮ
0.90
murm
0.89
whispered
0.89
Said
0.88
pause
0.86
Slowly
0.84
>>\
0.84
Activations Density 0.058%