INDEX
Explanations
conversational elements and dialogue markers in spoken exchanges
New Auto-Interp
Negative Logits
فريبيس
-1.11
'\\;'
-0.82
featureID
-0.74
aarrggbb
-0.74
himſelf
-0.71
itſelf
-0.71
ſy
-0.71
ſelves
-0.69
BRARY
-0.69
RenderAtEndOf
-0.68
POSITIVE LOGITS
Yeah
0.67
Yes
0.65
Yeah
0.62
yes
0.62
Yes
0.59
Oh
0.59
yeah
0.57
voyez
0.55
Absolutely
0.54
Oh
0.54
Activations Density 0.060%