INDEX
Explanations
sentences expressing shock, sadness, and sympathy towards victims of a tragic event
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
468
+0.17
0.5%
604
+0.10
0.3%
998
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
468
+0.17
0.08
838
+0.10
0.05
946
+0.10
0.06
Negative Logits
affez
-1.07
swarovski
-1.03
!...
-1.01
ecru
-1.01
broderie
-0.99
suscep
-0.98
gmbh
-0.97
?...
-0.95
fluo
-0.95
bordeaux
-0.93
POSITIVE LOGITS
incident
1.00
incidents
0.83
tragedy
0.76
occurrence
0.75
events
0.70
situation
0.69
happened
0.68
event
0.68
incident
0.68
tragic
0.65
Activations Density 0.522%