INDEX
Explanations
references to the musician "Bonnie" or mentions of his related activities
New Auto-Interp
Negative Logits
dfx
-0.85
UGE
-0.72
INAL
-0.67
ELD
-0.66
TY
-0.64
ãĥĥãĥī
-0.64
REAM
-0.63
gamer
-0.61
Ethics
-0.60
æĸ¹
-0.59
POSITIVE LOGITS
anza
1.17
uses
1.12
iton
1.10
itors
1.06
eless
1.02
anz
1.00
obo
0.96
obos
0.96
gey
0.95
neau
0.94
Activations Density 0.016%