INDEX
Explanations
references to political candidates and election-related terms
New Auto-Interp
Negative Logits
vert
-0.14
FX
-0.14
ÙĨز
-0.14
_closure
-0.14
ÏĢη
-0.14
jos
-0.14
pons
-0.14
Tubes
-0.13
ius
-0.13
abil
-0.13
POSITIVE LOGITS
Assembly
0.27
LS
0.23
assembly
0.23
Assembly
0.22
Lok
0.22
seats
0.22
reserved
0.21
fray
0.21
Reserved
0.21
reserved
0.20
Activations Density 0.006%