INDEX
Explanations
the word "verb" in sentences
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1137
+0.10
0.3%
1983
+0.08
0.2%
1654
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1363
+0.10
0.04
1896
+0.08
0.02
783
+0.07
0.03
Negative Logits
balas
-0.66
monta
-0.61
mals
-0.61
elit
-0.60
malu
-0.59
contentLoaded
-0.57
KY
-0.57
bolista
-0.57
rada
-0.55
Tur
-0.54
POSITIVE LOGITS
verb
1.93
verbs
1.66
Verbs
1.54
noun
1.40
verb
1.36
ecru
1.31
appunt
1.31
Verb
1.30
Verb
1.28
affez
1.22
Activations Density 0.301%