INDEX
Explanations
references to legal agreements and compliance with labor laws
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1499
+0.12
0.4%
1553
+0.09
0.3%
1585
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1499
+0.12
0.09
1585
+0.09
0.04
1776
+0.09
0.03
Negative Logits
pixar
-1.36
indescri
-1.29
hairc
-1.26
jurassic
-1.24
impra
-1.24
milf
-1.23
inconce
-1.22
hentai
-1.21
snoopy
-1.21
Mlle
-1.21
POSITIVE LOGITS
enforcement
0.87
enforcement
0.75
Enforcement
0.71
complaint
0.63
investigation
0.63
investigations
0.62
الدراسه
0.61
inspections
0.60
mergeFrom
0.59
enforce
0.59
Activations Density 0.614%