INDEX
Explanations
phrases related to practice, preparation, and dedication
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1222
+0.16
0.6%
795
+0.14
0.5%
486
+0.13
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1222
+0.16
0.03
795
+0.14
0.02
1562
+0.13
0.02
Negative Logits
nadat
-0.45
afgelopen
-0.45
rası
-0.44
gesteld
-0.44
juegan
-0.44
felici
-0.44
hyrchwyd
-0.42
palk
-0.42
حياته
-0.42
кульп
-0.42
POSITIVE LOGITS
Van
1.39
Van
1.32
VAN
1.31
van
1.20
VAN
1.10
van
1.08
vans
0.97
Vans
0.96
répon
0.95
exé
0.83
Activations Density 0.062%