INDEX
Explanations
references to a specific location or venue, particularly related to events or performances
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
59
+0.15
0.8%
410
+0.14
0.8%
328
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
332
+0.15
0.01
59
+0.14
0.01
7
+0.12
0.01
Negative Logits
Ń
-4.12
ľĵ
-4.05
Ĥ
-4.04
·
-3.90
↵
-3.85
<|outofrange|>
-3.85
↵
-3.85
↵
-3.85
-3.85
-3.85
POSITIVE LOGITS
ilion
2.61
illion
2.25
lei
2.03
illon
1.85
ilage
1.85
uet
1.71
itably
1.71
elled
1.67
ulous
1.67
opan
1.66
Activations Density 0.014%