INDEX
Explanations
terms related to environmental policies and energy production
New Auto-Interp
Negative Logits
wap
-0.16
Cout
-0.15
uess
-0.15
ATTER
-0.14
drains
-0.14
_marshall
-0.14
ngine
-0.14
jenter
-0.14
.synthetic
-0.14
rain
-0.14
POSITIVE LOGITS
rate
0.34
rate
0.30
Rate
0.28
Rate
0.27
-rate
0.25
rates
0.25
_rate
0.24
RATE
0.24
retail
0.23
rat
0.23
Activations Density 0.044%