INDEX
Explanations
expressions of surprise or excitement
Interjections or affirmations
expressive sounds and interjections
New Auto-Interp
Negative Logits
Мексичка
-1.14
itſelf
-1.05
Efq
-1.04
)");
-1.02
)";
-1.02
iſt
-1.02
―――――
-1.01
$")
-1.00
neſs
-1.00
فريبيس
-0.97
POSITIVE LOGITS
!
0.79
I
0.77
<eos>
0.68
you
0.63
…
0.63
freakin
0.62
freaking
0.62
!!!
0.60
Oh
0.58
yeah
0.57
Activations Density 0.176%