INDEX
Explanations
mentions of the word "fossil" or "fossil fuels."
references to fossil fuels
New Auto-Interp
Negative Logits
arta
-0.85
annis
-0.83
yang
-0.76
thel
-0.76
amber
-0.74
ansky
-0.73
urat
-0.72
ournal
-0.71
arter
-0.70
Norton
-0.70
POSITIVE LOGITS
fuels
1.13
fuel
1.03
fossil
0.94
ized
0.88
Foss
0.88
fuel
0.85
footprints
0.84
itions
0.83
ization
0.80
ifer
0.78
Activations Density 0.011%