INDEX
Explanations
mentions of particular events or performances in various locations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.17
0.6%
752
+0.08
0.3%
683
+0.08
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
678
+0.17
0.06
802
+0.08
0.05
820
+0.08
0.05
Negative Logits
<bos>
-1.52
ⓧ
-1.34
-1.21
/**
-1.06
<?
-1.02
/*
-0.91
}{||-0.76
/***
-0.75
EXPERIMENTS
-0.71
hindurch
-0.67
POSITIVE LOGITS
kram
1.29
silikon
1.29
Kategor
1.29
plak
1.27
alkoh
1.25
kosme
1.25
maksi
1.25
seksi
1.20
keramik
1.19
panik
1.18
Activations Density 0.332%