INDEX
Explanations
words related to the processing of information or materials
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1145
+0.15
0.5%
553
+0.14
0.5%
1334
+0.14
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
789
+0.15
0.03
553
+0.14
0.03
765
+0.14
0.02
Negative Logits
reele
-0.51
Köl
-0.45
vinos
-0.45
bolj
-0.43
cristi
-0.43
Uniti
-0.43
zeczytaj
-0.43
najbolj
-0.43
poiché
-0.42
gentes
-0.42
POSITIVE LOGITS
processing
1.17
Processing
1.12
processed
1.09
PROCESSING
1.08
processors
1.07
processing
1.07
Processed
1.05
Processing
1.03
processor
1.03
processed
1.02
Activations Density 0.049%