INDEX
Explanations
references to specific names or identities, particularly ones related to scandals or controversies
instances of a specific name or term repeated in various contexts
New Auto-Interp
Negative Logits
gie
-0.71
ãģŁ
-0.68
FUL
-0.67
Democr
-0.66
fully
-0.66
Valent
-0.66
Buc
-0.65
ger
-0.65
swick
-0.64
fulness
-0.64
POSITIVE LOGITS
olitics
1.25
olicy
1.02
odcast
1.01
acing
0.99
etition
0.94
roximately
0.93
roxy
0.93
rison
0.92
aic
0.91
acus
0.90
Activations Density 0.027%