INDEX
Explanations
sentences related to selection and recognition of projects and ideas
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
678
+0.09
0.3%
1652
+0.09
0.3%
319
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1652
+0.09
0.04
81
+0.09
0.02
1551
+0.07
0.04
Negative Logits
DeleteMapping
-0.68
ficit
-0.56
'\\;'
-0.53
PutMapping
-0.53
)=-\
-0.51
kmäler
-0.51
الرياضيه
-0.51
BITDA
-0.50
энциклопедия
-0.50
kmale
-0.50
POSITIVE LOGITS
impra
1.54
depic
1.50
reluct
1.44
accla
1.44
encomp
1.42
increa
1.41
affor
1.37
maneu
1.37
shenan
1.36
emphat
1.35
Activations Density 0.491%