INDEX
Explanations
mentions of emissions or related terms
references to emissions and their impacts on the environment
New Auto-Interp
Negative Logits
Else
-0.93
ivas
-0.81
inosaur
-0.75
ISH
-0.74
anova
-0.73
rooms
-0.72
slice
-0.71
ciating
-0.68
Else
-0.68
amina
-0.68
POSITIVE LOGITS
emissions
1.35
emission
1.14
emitting
1.04
emit
0.95
dioxide
0.93
pollution
0.93
gases
0.92
emitted
0.90
emits
0.87
pollutants
0.84
Activations Density 0.026%