INDEX
Explanations
phrases related to conditions or situations of need or assistance
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1438
+0.15
0.5%
897
+0.13
0.4%
1515
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1515
+0.15
0.05
1438
+0.13
0.05
101
+0.12
0.05
Negative Logits
pymysql
-0.75
Più
-0.62
Queste
-0.61
Vedi
-0.61
Infatti
-0.60
Scopri
-0.60
cæ
-0.60
Cfr
-0.59
Informações
-0.59
Sì
-0.57
POSITIVE LOGITS
when
0.99
when
0.92
WHEN
0.86
When
0.76
WHEN
0.74
When
0.71
quando
0.68
cuando
0.66
whenever
0.66
they
0.63
Activations Density 0.160%