INDEX
Explanations
phrases related to cause and effect
terms related to ecological impact and interconnectedness
New Auto-Interp
Negative Logits
eral
-1.04
swick
-0.89
rique
-0.85
ERAL
-0.85
rano
-0.83
nery
-0.82
rar
-0.81
fighter
-0.81
seless
-0.80
imen
-0.76
POSITIVE LOGITS
effect
1.01
Effect
0.86
effects
0.85
ripple
0.79
cascade
0.79
Effects
0.79
ecosystem
0.78
casc
0.77
sclerosis
0.76
downstream
0.75
Activations Density 0.147%