INDEX
Explanations
mentions of politicians, specifically focusing on the name patterns similar to "Biden" and "Clinton"
names related to political figures or identities
New Auto-Interp
Negative Logits
exha
-0.89
vati
-0.77
rul
-0.75
pmwiki
-0.75
weeds
-0.74
«ĺ
-0.73
murd
-0.73
thora
-0.72
ILCS
-0.72
ãħĭ
-0.71
POSITIVE LOGITS
iden
1.23
ovo
0.97
vier
0.92
ners
0.91
ception
0.89
unci
0.88
heimer
0.85
fold
0.84
vironment
0.83
ning
0.82
Activations Density 0.013%