INDEX
Explanations
abbreviations and acronyms related to medical terms
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1983
+0.08
0.3%
1735
+0.08
0.3%
50
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1343
+0.08
0.05
1735
+0.08
0.03
144
+0.07
0.03
Negative Logits
<bos>
-1.09
build
-0.62
public
-0.62
export
-0.61
лык
-0.60
moved
-0.59
buồn
-0.59
consider
-0.58
ativistic
-0.58
build
-0.58
POSITIVE LOGITS
increa
1.85
affor
1.80
maneu
1.75
SOO
1.60
accla
1.59
volunte
1.57
strick
1.57
resear
1.56
guarante
1.55
effe
1.55
Activations Density 0.206%