INDEX
Explanations
terms related to software licenses and permissions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
71
+0.12
0.7%
369
+0.12
0.7%
74
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
74
+0.12
0.02
222
+0.12
0.02
86
+0.11
0.02
Negative Logits
ľĵ
-2.35
Ĺ
-1.86
purpose
-1.62
ÑĢоÑģ
-1.57
purpose
-1.57
stood
-1.55
Į
-1.50
opsies
-1.50
Ł
-1.50
>",
-1.48
POSITIVE LOGITS
uelle
1.61
umab
1.50
itious
1.48
religion
1.47
aceous
1.47
den
1.46
bery
1.45
holm
1.42
yourselves
1.39
ner
1.38
Activations Density 0.197%