INDEX
Explanations
references to dam construction and related activities
mentions of dams
New Auto-Interp
Negative Logits
lihood
-0.83
Americans
-0.64
Reloaded
-0.63
Prosecut
-0.62
ISTER
-0.62
Addiction
-0.62
Hawth
-0.62
Disorder
-0.61
endorsements
-0.61
ASY
-0.60
POSITIVE LOGITS
ascus
1.11
dam
1.08
dams
0.94
lp
0.92
ming
0.90
iets
0.85
mit
0.82
ilitation
0.82
ework
0.80
sel
0.79
Activations Density 0.011%