INDEX
Explanations
references to specific operatic works and their characters
New Auto-Interp
Negative Logits
overd
-0.17
_ORIGIN
-0.16
ìĹĦ
-0.15
Framework
-0.15
velt
-0.15
vik
-0.15
Pew
-0.15
quil
-0.14
aftermarket
-0.14
ìľ¡
-0.14
POSITIVE LOGITS
Tos
0.23
Fal
0.21
Norm
0.19
Nab
0.19
Boris
0.18
Ot
0.18
Fal
0.18
Masc
0.17
Carmen
0.17
Ring
0.17
Activations Density 0.010%