INDEX
Explanations
phrases related to angles or directions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1376
+0.17
0.8%
1335
+0.15
0.8%
1705
+0.14
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1376
+0.17
0.03
251
+0.15
0.02
1385
+0.14
0.03
Negative Logits
<bos>
-1.56
ⓧ
-0.96
/**
-0.79
/*
-0.71
<?
-0.62
ren
-0.59
-0.59
put
-0.59
Élet
-0.58
Kar
-0.57
POSITIVE LOGITS
angle
1.46
Angle
1.39
angles
1.38
Angle
1.22
Minang
1.22
angle
1.19
ANGLE
1.16
meis
1.14
Angles
1.13
kram
1.12
Activations Density 0.313%