INDEX
Explanations
references to events or gatherings happening in a specific location
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
390
+0.12
0.6%
313
+0.11
0.5%
1527
+0.10
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
437
+0.12
0.03
1590
+0.11
0.03
405
+0.10
0.02
Negative Logits
<bos>
-1.82
Vegeu
-0.91
ždý
-0.62
Externé
-0.61
Conteúdo
-0.60
Vanjske
-0.60
/***
-0.59
בְּ
-0.58
película
-0.58
Carreira
-0.58
POSITIVE LOGITS
Reception
1.33
reception
1.32
reception
1.25
receptions
1.22
Reception
1.17
RECE
1.16
receptionist
1.04
tucson
1.04
jaya
0.99
receivers
0.99
Activations Density 0.235%