INDEX
Explanations
environmental concerns and potential disasters
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
752
+0.16
0.5%
1842
+0.11
0.3%
690
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
802
+0.16
0.04
194
+0.11
0.01
1499
+0.11
0.04
Negative Logits
tantôt
-0.61
Darío
-0.59
onaldo
-0.59
latego
-0.58
McLaugh
-0.57
Pued
-0.56
intitu
-0.55
posób
-0.55
logitech
-0.55
réaliste
-0.55
POSITIVE LOGITS
ecosystem
0.76
ecosystems
0.75
thut
0.73
effe
0.67
biodiversity
0.65
foon
0.63
aen
0.62
ftu
0.61
nece
0.61
totalCount
0.60
Activations Density 0.334%