INDEX
Explanations
phrases related to career development
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
198
+0.09
0.3%
405
+0.09
0.3%
674
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
47
+0.09
0.04
405
+0.09
0.03
2045
+0.09
0.04
Negative Logits
FontOfSize
-0.50
instead
-0.45
was
-0.45
became
-0.44
suddenly
-0.43
anyone
-0.43
KEYCODE
-0.42
wasn
-0.42
InputTagHelper
-0.41
Anyone
-0.41
POSITIVE LOGITS
<bos>
1.14
nutella
0.98
doman
0.97
affez
0.95
milano
0.95
sappi
0.94
soggior
0.94
territo
0.89
napoli
0.89
ristor
0.86
Activations Density 0.119%