INDEX
Explanations
direct quotes starting with the word "Don"
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1604
+0.13
0.5%
1335
+0.13
0.5%
812
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1604
+0.13
0.04
1335
+0.13
0.05
812
+0.11
0.03
Negative Logits
saites
-0.49
HasForeignKey
-0.49
ibatis
-0.49
reactivex
-0.47
chartInstance
-0.46
囗
-0.46
clable
-0.42
розта
-0.42
Население
-0.42
виді
-0.42
POSITIVE LOGITS
désol
0.98
disagre
0.92
DON
0.91
effray
0.91
Don
0.89
malheure
0.88
Don
0.86
encomp
0.85
osal
0.85
unwarran
0.84
Activations Density 0.091%