INDEX
Explanations
references to lobbying and lobbyists
New Auto-Interp
Negative Logits
ume
-0.16
oader
-0.15
Templ
-0.15
ooke
-0.15
ỡ
-0.15
bucks
-0.15
Howell
-0.15
iber
-0.14
853
-0.14
ekyll
-0.13
POSITIVE LOGITS
antenn
0.15
åĹ
0.15
atar
0.15
309
0.14
ARENT
0.14
665
0.14
staw
0.14
sten
0.14
rl
0.14
oru
0.14
Activations Density 0.009%