INDEX
Explanations
requests for attribution or credit in a written work or creation
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1150
+0.12
0.4%
674
+0.09
0.3%
1343
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1641
+0.12
0.03
185
+0.09
0.02
1150
+0.09
0.01
Negative Logits
Spoljašnje
-0.65
Glej
-0.58
CompleteListener
-0.57
Iné
-0.54
ModelExpression
-0.53
IsRequired
-0.53
<>());
-0.53
Poznám
-0.52
HasAnnotation
-0.52
Pozri
-0.52
POSITIVE LOGITS
hentai
1.04
scrat
0.97
inext
0.95
indestru
0.88
affor
0.88
casio
0.86
milf
0.86
strick
0.86
uniqu
0.86
perfet
0.85
Activations Density 0.201%