INDEX
Explanations
references to specific ordinances or legal regulations
New Auto-Interp
Negative Logits
ively
-0.18
eyer
-0.16
keleton
-0.16
asty
-0.15
eck
-0.15
istics
-0.15
aty
-0.14
ipp
-0.14
een
-0.14
arken
-0.14
POSITIVE LOGITS
inance
0.30
inals
0.28
ination
0.27
entlich
0.27
inary
0.26
INARY
0.26
nung
0.26
oliberal
0.25
ained
0.24
inate
0.23
Activations Density 0.007%