INDEX
Explanations
mentions of a specific location or venue
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
964
+0.10
0.3%
906
+0.09
0.2%
1551
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1919
+0.10
0.04
1415
+0.09
0.03
73
+0.08
0.03
Negative Logits
hcm
-0.88
umo
-0.84
Quod
-0.80
unlaw
-0.80
kasa
-0.80
isolato
-0.77
termica
-0.76
tulum
-0.75
fep
-0.74
politika
-0.74
POSITIVE LOGITS
<bos>
0.75
familiar
0.74
know
0.70
knows
0.66
probably
0.66
familiar
0.61
remember
0.59
probably
0.58
know
0.56
familiarity
0.56
Activations Density 0.253%