INDEX
Explanations
references to ordinances or legislative measures
New Auto-Interp
Negative Logits
enko
-0.16
een
-0.16
eyer
-0.16
argout
-0.16
erior
-0.16
evice
-0.16
ively
-0.15
asty
-0.15
istics
-0.14
etÃŃ
-0.14
POSITIVE LOGITS
inance
0.32
ained
0.28
ination
0.26
inary
0.26
oliberal
0.25
INARY
0.25
inals
0.24
nung
0.23
entlich
0.23
inate
0.22
Activations Density 0.007%