INDEX
Explanations
instances of the exclamation "Ha" indicating laughter or surprise, often in various stylings or repetitions
New Auto-Interp
Negative Logits
lec
-0.16
pong
-0.15
cross
-0.15
stroy
-0.15
212
-0.14
Wid
-0.14
h
-0.14
jets
-0.14
horn
-0.14
wid
-0.14
POSITIVE LOGITS
unted
0.24
ha
0.24
iku
0.23
Ha
0.22
Ha
0.21
ifax
0.21
ifa
0.20
unting
0.20
iley
0.19
emat
0.18
Activations Density 0.011%