INDEX
Explanations
features and functions related to coding and programming languages
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
876
+0.25
0.8%
872
+0.12
0.4%
2033
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
2044
+0.25
0.06
876
+0.12
-0.01
872
+0.09
0.06
Negative Logits
Demokrat
-0.67
'\\;'
-0.66
Joko
-0.64
Gnaden
-0.62
akut
-0.60
Fulda
-0.59
Widodo
-0.58
+#+#
-0.58
Duisburg
-0.58
Öster
-0.58
POSITIVE LOGITS
tupperware
1.11
embodi
1.09
impractica
1.07
peppa
1.04
ROBER
0.99
hairc
0.97
jacques
0.97
gsx
0.95
vété
0.94
poff
0.94
Activations Density 0.449%