INDEX
Explanations
instances of past tense verbs in the context of events or actions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
56
+0.13
0.7%
58
+0.12
0.7%
421
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
69
+0.13
0.35
151
+0.12
0.26
98
+0.11
0.11
Negative Logits
thereon
-1.49
primes
-1.40
showers
-1.39
weddings
-1.39
baths
-1.39
soever
-1.37
streams
-1.35
expects
-1.33
crowds
-1.29
teeth
-1.28
POSITIVE LOGITS
uve
1.45
itage
1.45
documentclass
1.43
jour
1.42
aka
1.41
uk
1.41
nick
1.39
gard
1.37
leaf
1.36
jun
1.36
Activations Density 2.302%