INDEX
Explanations
references to emissions trading and environmental policies
New Auto-Interp
Negative Logits
Salad
-0.16
ergus
-0.16
Sal
-0.15
upil
-0.14
amoto
-0.14
aland
-0.14
lust
-0.14
salad
-0.13
ompson
-0.13
ÑĤин
-0.13
POSITIVE LOGITS
Carbon
0.17
ruc
0.16
/MPL
0.16
carbon
0.16
Carbon
0.16
embar
0.15
carbon
0.15
coal
0.15
coal
0.15
ÄĽti
0.15
Activations Density 0.034%