INDEX
Explanations
phrases related to toys and toy sales
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.17
1.0%
1416
+0.12
0.7%
406
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1416
+0.17
0.03
406
+0.12
0.02
1778
+0.11
0.03
Negative Logits
<bos>
-2.90
ⓧ
-1.04
-0.97
intersper
-0.94
/**
-0.93
<?
-0.93
disbur
-0.86
defray
-0.83
/***
-0.82
/*!
-0.70
POSITIVE LOGITS
toys
1.30
toy
1.28
Toy
1.24
Toy
1.21
Toys
1.12
toy
1.09
TOY
1.07
Toys
1.06
toys
0.99
riva
0.98
Activations Density 0.121%