INDEX
Explanations
time-related phrases and references
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
98
+0.16
0.9%
64
+0.11
0.6%
430
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
98
+0.16
0.00
135
+0.11
0.02
64
+0.11
0.02
Negative Logits
ĻĤ
-1.83
naire
-1.54
Ļ
-1.52
anchor
-1.45
oxidase
-1.45
gered
-1.44
OTO
-1.43
ħ
-1.43
%%
-1.39
indicators
-1.36
POSITIVE LOGITS
vicinity
1.65
Sessions
1.59
cliffe
1.45
hers
1.42
field
1.39
oft
1.37
combe
1.36
dale
1.34
ford
1.33
>'
1.33
Activations Density 0.284%