INDEX
Explanations
references to theatrical venues
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1133
+0.12
0.5%
1778
+0.12
0.5%
1392
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1778
+0.12
0.02
1133
+0.12
0.02
1392
+0.12
0.02
Negative Logits
accla
-0.60
Sén
-0.57
Docteur
-0.53
carbone
-0.51
bourg
-0.51
toul
-0.50
Classe
-0.49
Carga
-0.48
vinyle
-0.48
onymus
-0.48
POSITIVE LOGITS
theater
1.25
theatre
1.21
Theater
1.12
Theatre
1.11
theaters
1.10
theatre
1.08
theater
1.04
Theatre
1.03
theatres
1.02
Theater
1.02
Activations Density 0.086%