INDEX
Explanations
mentions of potential risks or causes related to environmental issues
New Auto-Interp
Negative Logits
odan
-0.73
ritical
-0.62
reb
-0.61
ĪĴ
-0.61
cemic
-0.61
robe
-0.60
claimer
-0.60
tan
-0.58
ãĥ³ãĤ¸
-0.56
dit
-0.55
POSITIVE LOGITS
everywhere
1.02
concurrently
1.01
elsewhere
0.95
throughout
0.95
anywhere
0.93
across
0.85
during
0.84
intermitt
0.83
here
0.83
alongside
0.81
Activations Density 4.774%