INDEX
Explanations
exclamations expressing strong emotions or frustrations
expressions of frustration or disbelief
New Auto-Interp
Negative Logits
ufact
-0.76
083
-0.76
iku
-0.72
Vert
-0.69
cit
-0.68
Ĭ±
-0.67
Joy
-0.64
士
-0.63
EStreamFrame
-0.62
hig
-0.59
POSITIVE LOGITS
happened
0.79
else
0.72
?!
0.72
!?"
0.68
holes
0.67
dude
0.66
!?
0.66
else
0.65
dar
0.64
rubbish
0.64
Activations Density 0.021%