INDEX
Explanations
medical and health-related information, instructions or recommendations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
381
+0.25
0.8%
1510
+0.13
0.4%
569
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1919
+0.25
0.06
1510
+0.13
0.05
478
+0.10
0.05
Negative Logits
quoc
-0.82
monaster
-0.82
venuto
-0.76
churrasco
-0.75
persil
-0.73
churras
-0.71
barbacoa
-0.66
habang
-0.66
tortas
-0.66
nguyen
-0.65
POSITIVE LOGITS
also
1.44
also
1.27
Also
1.18
Also
1.18
furthermore
1.14
também
1.01
moreover
1.00
ALSO
0.96
также
0.95
inoltre
0.93
Activations Density 0.468%