INDEX
Explanations
information related to events, people, and actions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
184
+0.13
0.4%
468
+0.10
0.3%
776
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1023
+0.13
0.05
161
+0.10
0.04
1244
+0.10
0.04
Negative Logits
abbra
-1.01
espé
-0.92
affez
-0.89
sguardo
-0.88
autunno
-0.86
canel
-0.85
specchio
-0.84
auguri
-0.82
morire
-0.82
<bos>
-0.80
POSITIVE LOGITS
latter
0.65
happened
0.62
coincided
0.62
incident
0.62
event
0.61
corresponded
0.60
resulted
0.59
episode
0.59
prompted
0.57
circumstance
0.56
Activations Density 0.204%