INDEX
Explanations
key events, shows, or performances mentioned in the text
New Auto-Interp
Negative Logits
yon
-0.15
otu
-0.14
gradation
-0.13
ekli
-0.13
asso
-0.13
vrd
-0.13
олж
-0.13
ksi
-0.13
atern
-0.13
fad
-0.13
POSITIVE LOGITS
features
0.93
feature
0.90
features
0.81
Features
0.81
feature
0.77
Features
0.74
Feature
0.74
-feature
0.72
Feature
0.71
FEATURES
0.68
Activations Density 0.203%