INDEX
Explanations
phrases related to teamwork and collaboration
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1413
+0.13
0.5%
442
+0.12
0.4%
650
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
442
+0.13
0.03
1413
+0.12
0.02
650
+0.12
0.02
Negative Logits
ideolog
-0.68
solidar
-0.67
Fara
-0.55
Víctor
-0.55
Lucía
-0.55
konkre
-0.55
Mónica
-0.54
Gost
-0.53
Áng
-0.52
Héctor
-0.52
POSITIVE LOGITS
combined
0.75
combine
0.75
combined
0.73
Combined
0.69
COMBIN
0.69
Combined
0.68
combines
0.68
combin
0.66
combining
0.66
combinado
0.65
Activations Density 0.068%