INDEX
Explanations
information related to upcoming events, sales, healthcare process improvement measures, vitamin information, and hate crimes statistics
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
394
+0.15
0.4%
50
+0.12
0.3%
227
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
227
+0.15
0.09
919
+0.12
0.04
394
+0.11
0.07
Negative Logits
Билгалдахарш
-0.86
脚注の使い方
-0.81
<bos>
-0.79
estekak
-0.72
Personendaten
-0.71
ValueGeneration
-0.69
biograf
-0.69
höl
-0.67
uxxxx
-0.66
utop
-0.65
POSITIVE LOGITS
unspeak
1.32
apprehen
1.21
vainly
1.18
ineffec
1.15
disagre
1.10
unwarran
1.09
indescri
1.07
gaily
1.04
tolerably
1.02
impelled
1.01
Activations Density 0.688%