INDEX
Explanations
mentions of the artist Lady Gaga
references to Lady Gaga
New Auto-Interp
Negative Logits
kson
-0.80
aeda
-0.78
osta
-0.74
iary
-0.66
venants
-0.66
DX
-0.66
ologically
-0.65
emp
-0.65
iku
-0.64
icted
-0.64
POSITIVE LOGITS
Gaga
1.34
bug
1.14
bird
1.05
bugs
1.05
maid
0.97
cup
0.85
birds
0.84
fing
0.79
folk
0.79
Diana
0.78
Activations Density 0.025%