INDEX
Explanations
questions related to job expectations and responsibilities
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1978
+0.14
0.6%
776
+0.13
0.5%
1984
+0.13
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
776
+0.14
0.06
892
+0.13
0.05
1678
+0.13
0.04
Negative Logits
Música
-0.52
toContain
-0.50
cuadro
-0.50
asteroide
-0.48
βο
-0.47
Após
-0.47
ResponseEntity
-0.47
osoba
-0.47
Să
-0.47
addCriterion
-0.47
POSITIVE LOGITS
nomine
1.03
sopr
0.99
NINE
0.99
sappi
0.96
coq
0.93
milano
0.91
bourgeo
0.90
jorge
0.89
Cfr
0.88
peppa
0.87
Activations Density 0.149%