INDEX
Explanations
references to legislative processes and government officials
New Auto-Interp
Negative Logits
emouth
-0.16
âĶĶ
-0.15
ehir
-0.15
çIJ
-0.14
ifter
-0.14
anni
-0.14
anders
-0.14
ĮĢ
-0.13
æŀIJ
-0.13
anmeld
-0.13
POSITIVE LOGITS
bottoms
0.16
ocale
0.15
ulet
0.15
Fro
0.15
lename
0.14
iais
0.14
patterns
0.13
acker
0.13
è¿ij
0.13
============================================================================↵
0.13
Activations Density 0.053%