INDEX
Explanations
mentions of climate change
the topic of climate change
New Auto-Interp
Negative Logits
ioned
-0.77
IME
-0.77
dos
-0.76
ittal
-0.76
ments
-0.76
Niet
-0.75
wered
-0.74
amina
-0.73
Caldwell
-0.71
onen
-0.71
POSITIVE LOGITS
change
1.07
Change
1.02
warming
0.98
Armageddon
0.91
climate
0.90
catastrophe
0.90
extremes
0.88
change
0.87
sensitivity
0.87
dioxide
0.86
Activations Density 0.039%