INDEX
Explanations
text related to software development tools and technologies
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
478
+0.17
0.6%
382
+0.17
0.6%
1937
+0.15
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1937
+0.17
0.09
478
+0.17
0.07
382
+0.15
0.06
Negative Logits
kafe
-0.90
requipa
-0.90
silikon
-0.89
provoque
-0.87
bunda
-0.87
mikrofon
-0.86
alkoh
-0.84
remonte
-0.84
karton
-0.83
balon
-0.83
POSITIVE LOGITS
intersper
1.35
unspeak
1.25
shenan
1.21
apprehen
1.20
indescri
1.17
horrend
1.15
snoopy
1.09
encomp
1.09
ineffec
1.09
unavoid
1.07
Activations Density 0.363%