INDEX
Explanations
references to organizations and specific events related to research and public policy
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
453
+0.19
1.0%
1577
+0.17
0.9%
1343
+0.16
0.9%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1343
+0.19
0.25
227
+0.17
0.23
453
+0.16
0.18
Negative Logits
<bos>
-3.53
ⓧ
-0.87
@[+][
-0.85
onStop
-0.84
IsMutable
-0.83
betweenstory
-0.78
onPause
-0.77
Fordítás
-0.76
ItemBackground
-0.76
springfox
-0.74
POSITIVE LOGITS
gettyimages
1.23
pleins
1.20
uefa
1.18
habile
1.17
éto
1.17
maroc
1.17
confé
1.13
milano
1.11
getty
1.10
sergio
1.08
Activations Density 4.096%