INDEX
Explanations
expressions of amazement or admiration
New Auto-Interp
Negative Logits
SequentialGroup
-0.54
no
-0.54
(
-0.53
contrario
-0.51
Old
-0.51
vodu
-0.51
そろそろ
-0.49
et
-0.49
子上
-0.49
Руси
-0.49
POSITIVE LOGITS
amazing
1.41
amazing
1.34
Amazing
1.32
feats
1.27
Amazing
1.25
AMAZING
1.24
Impressive
1.22
Spectacular
1.22
Stunning
1.21
Incredible
1.19
Activations Density 0.205%