INDEX
Explanations
medical terms and research-related keywords
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.15
0.8%
577
+0.14
0.7%
568
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
577
+0.15
0.05
568
+0.14
0.06
227
+0.11
0.06
Negative Logits
<bos>
-1.86
ⓧ
-0.64
مرئيه
-0.62
chengladbach
-0.60
springfox
-0.60
znaleźć
-0.58
warran
-0.56
sikkert
-0.56
Și
-0.56
puțin
-0.53
POSITIVE LOGITS
utop
1.10
umbro
0.98
quoique
0.98
uefa
0.96
marte
0.95
riviera
0.94
Ub
0.93
Græ
0.92
eiffel
0.92
Ueb
0.90
Activations Density 0.471%