INDEX
Explanations
connecting phrases using the word 'and'
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1921
+0.10
0.3%
1350
+0.10
0.3%
1892
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1921
+0.10
0.06
1892
+0.10
0.05
7
+0.10
0.04
Negative Logits
Simult
-0.71
Derivation
-0.70
alip
-0.70
JAKARTA
-0.69
Effec
-0.68
Applicability
-0.68
Fixation
-0.64
Adieu
-0.64
Utilisation
-0.64
Descrip
-0.62
POSITIVE LOGITS
ConstraintMaker
0.53
WebElementEntity
0.51
fflush
0.50
quién
0.49
underval
0.48
invokingState
0.47
ProtoMessage
0.46
AND
0.46
ilever
0.45
hombres
0.44
Activations Density 0.224%