INDEX
Explanations
references to climate change and related terms
mentions of climate change
New Auto-Interp
Negative Logits
Niet
-0.82
ioned
-0.82
ments
-0.77
umbn
-0.75
Caldwell
-0.70
dos
-0.68
phabet
-0.67
ittle
-0.66
imity
-0.66
ery
-0.66
POSITIVE LOGITS
change
1.35
Change
1.17
change
1.10
Change
1.02
catastrophe
0.99
warming
0.96
sensitivity
0.93
extremes
0.93
science
0.91
scept
0.89
Activations Density 0.030%