INDEX
Explanations
technical terms related to critical thinking and analysis
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.15
0.9%
874
+0.14
0.9%
1974
+0.13
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
874
+0.15
0.03
1974
+0.14
0.03
239
+0.13
0.03
Negative Logits
<bos>
-3.19
/***
-0.77
-0.72
//{
-0.68
///**
-0.67
<?
-0.66
/*++
-0.63
//*/
-0.59
introduce
-0.59
assistir
-0.59
POSITIVE LOGITS
Critical
1.10
Minang
1.09
critical
1.07
grossa
1.02
Critical
1.02
critical
1.01
corrom
0.99
jaya
0.98
parati
0.97
lele
0.97
Activations Density 0.097%