INDEX
Explanations
phrases related to entering or joining different entities or situations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1323
+0.15
0.6%
421
+0.14
0.5%
1416
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1323
+0.15
0.04
1416
+0.14
0.03
1526
+0.11
0.03
Negative Logits
kwds
-0.54
ciclop
-0.51
XIM
-0.50
agosto
-0.49
AddWithValue
-0.47
MAXN
-0.47
lês
-0.46
DispatchToProps
-0.45
месте
-0.45
dc
-0.45
POSITIVE LOGITS
shenan
1.32
intersper
1.31
Entered
1.20
scrat
1.19
Entering
1.15
hairc
1.15
depic
1.14
casio
1.13
snoopy
1.13
hentai
1.12
Activations Density 0.112%