INDEX
Explanations
terms and phrases associated with medical and technical contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
235
+0.13
0.8%
174
+0.13
0.7%
237
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
313
+0.13
0.05
174
+0.13
0.13
237
+0.11
0.12
Negative Logits
promptly
-1.48
matt
-1.47
mains
-1.43
Instr
-1.40
rame
-1.36
magazines
-1.36
soon
-1.33
blogger
-1.32
afterwards
-1.31
IU
-1.31
POSITIVE LOGITS
aud
1.58
',
1.49
uther
1.49
iment
1.43
ytes
1.42
ULAR
1.38
alth
1.35
**
1.34
erculosis
1.34
\'
1.34
Activations Density 4.655%