INDEX
Explanations
mentions of colleagues in different contexts and settings
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
513
+0.11
0.4%
1870
+0.09
0.3%
1363
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
41
+0.11
0.04
513
+0.09
0.03
1252
+0.09
0.03
Negative Logits
Manufact
-0.58
interessa
-0.55
*/
-0.53
Tilt
-0.47
seamnă
-0.47
Mst
-0.47
homePage
-0.46
MV
-0.46
']='
-0.45
Enumer
-0.44
POSITIVE LOGITS
colleagues
1.11
colleague
1.06
Colleagues
0.91
coworkers
0.82
teammates
0.81
coworker
0.76
teammate
0.73
comrades
0.71
classmates
0.70
peers
0.70
Activations Density 0.100%