INDEX
Explanations
exclamations or expressions of strong emphasis
intensifiers and exclamatory expressions
New Auto-Interp
Negative Logits
aft
-0.76
onial
-0.72
ministic
-0.69
xia
-0.67
senses
-0.65
edom
-0.63
ramid
-0.63
uv
-0.63
EStreamFrame
-0.63
ongyang
-0.63
POSITIVE LOGITS
heck
1.40
louder
0.80
bats
0.78
Heck
0.78
HELL
0.75
chuck
0.75
Jagu
0.74
Pengu
0.74
thous
0.70
storms
0.70
Activations Density 0.005%