INDEX
Explanations
the word "rather" followed by either a verb or a negation
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
161
+0.13
0.4%
50
+0.13
0.4%
1065
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1839
+0.13
0.03
426
+0.13
0.02
989
+0.11
0.02
Negative Logits
echo
-0.58
Excluir
-0.58
isset
-0.56
AfterViewInit
-0.56
DecimalFormat
-0.55
extend
-0.54
foreach
-0.54
美
-0.53
catch
-0.53
tick
-0.52
POSITIVE LOGITS
swarovski
1.52
depic
1.34
eiffel
1.30
stockholm
1.29
eyel
1.28
flyknit
1.28
lola
1.25
ecru
1.24
hairc
1.23
tiffany
1.23
Activations Density 0.195%