INDEX
Explanations
the word "to" in various contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
19
+0.13
0.7%
346
+0.13
0.7%
414
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
69
+0.13
0.38
151
+0.13
0.31
434
+0.12
0.30
Negative Logits
retrospect
-1.34
psychiat
-1.33
orms
-1.32
my
-1.29
GIS
-1.26
Äį
-1.23
logging
-1.22
romycin
-1.20
normalized
-1.20
depression
-1.20
POSITIVE LOGITS
»¿
2.03
ĻĤ
1.92
ĨĴ
1.70
ĩ
1.67
inel
1.66
½
1.65
therefrom
1.62
Ĩ
1.61
ŀ
1.50
©
1.49
Activations Density 1.625%