INDEX
Explanations
adverbs modifying verbs
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
650
+0.16
0.5%
605
+0.11
0.4%
347
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
650
+0.16
0.06
1575
+0.11
0.04
1758
+0.09
0.04
Negative Logits
Sav
-0.47
Guid
-0.44
Deriv
-0.43
OnDelete
-0.42
Activator
-0.42
Reve
-0.42
CommandType
-0.41
Org
-0.41
Gregory
-0.41
Gonz
-0.41
POSITIVE LOGITS
affez
1.03
preghi
0.91
thermomix
0.86
santana
0.86
stihl
0.85
sappi
0.85
soggior
0.84
chery
0.83
bauer
0.82
pecuni
0.82
Activations Density 0.085%