INDEX
Explanations
references to legislative actions or propositions
New Auto-Interp
Negative Logits
ering
-0.18
phy
-0.17
ane
-0.17
ered
-0.16
ãĤĥ
-0.16
er
-0.15
ritt
-0.15
rane
-0.14
rupt
-0.14
å¯Ĩ
-0.14
POSITIVE LOGITS
rieve
0.25
atri
0.23
lica
0.23
ulsive
0.22
resent
0.22
uls
0.20
lication
0.20
rep
0.20
licate
0.20
udi
0.19
Activations Density 0.017%