INDEX
Explanations
mentions of the name "Sarkozy" and related references in political contexts
New Auto-Interp
Negative Logits
âĸ¬âĸ¬
-0.75
66666666
-0.73
ãĥĩãĤ£
-0.69
Universal
-0.66
ters
-0.65
LEASE
-0.65
VAT
-0.62
tered
-0.62
neurot
-0.61
yright
-0.61
POSITIVE LOGITS
ozy
1.56
olini
0.96
iller
0.93
anski
0.93
esian
0.93
indal
0.89
inia
0.88
mire
0.88
aran
0.87
arin
0.85
Activations Density 0.004%