INDEX
Explanations
references to the abbreviation "AC" followed by a number, potentially related to a specific context or topic
references to the band AC/DC
New Auto-Interp
Negative Logits
ozyg
-0.85
bush
-0.83
hood
-0.77
glers
-0.74
jar
-0.72
doms
-0.69
axy
-0.65
lihood
-0.62
chens
-0.62
Fenrir
-0.62
POSITIVE LOGITS
CEPT
1.04
oustic
1.04
ORN
0.94
rylic
0.88
oust
0.87
APTER
0.84
KN
0.84
ritical
0.84
SI
0.84
HE
0.83
Activations Density 0.025%