INDEX
Explanations
information about various places, events, and activities
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
382
+0.21
1.1%
50
+0.20
1.1%
1535
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
382
+0.21
0.10
1352
+0.20
0.07
1445
+0.13
0.07
Negative Logits
<bos>
-3.01
intersper
-1.59
endow
-1.26
quitted
-1.26
forbear
-1.22
ⓧ
-1.21
vainly
-1.19
disambigu
-1.17
reconno
-1.16
unve
-1.14
POSITIVE LOGITS
marea
0.86
miniatura
0.85
seksi
0.82
Portugu
0.72
cosmé
0.72
kokos
0.72
bobina
0.72
gelatina
0.72
silikon
0.72
manuten
0.71
Activations Density 0.213%