INDEX
Explanations
exclamations expressing surprise or strong emotion
expressions of surprise or exclamation
New Auto-Interp
Negative Logits
20439
-0.72
-+-+
-0.66
":[{"-0.66
resso
-0.65
CTR
-0.62
iband
-0.61
Introduced
-0.61
*/(
-0.60
perature
-0.59
ngth
-0.59
POSITIVE LOGITS
yeah
1.23
hhhh
1.18
hhh
1.12
hh
1.12
yea
1.05
yeah
1.03
dear
1.01
hey
0.99
yes
0.93
wow
0.93
Activations Density 0.022%