INDEX
Explanations
references to musical artists and their works
New Auto-Interp
Negative Logits
007
-0.16
athed
-0.15
åľ¨çº¿éĺħ读
-0.15
oser
-0.15
FRING
-0.15
ATEGORY
-0.14
anka
-0.14
Booker
-0.14
αιν
-0.14
350
-0.14
POSITIVE LOGITS
idols
0.21
idol
0.20
member
0.19
nuest
0.19
MV
0.17
agency
0.17
members
0.17
Idol
0.17
rookies
0.17
.member
0.17
Activations Density 0.050%