INDEX
Explanations
references to subway systems and related topics
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1870
+0.11
0.3%
406
+0.10
0.3%
1385
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1601
+0.11
0.03
1705
+0.10
0.03
568
+0.10
0.03
Negative Logits
lara
-0.94
ciao
-0.90
nicolas
-0.90
?...
-0.90
embra
-0.89
blos
-0.89
encomp
-0.87
inev
-0.87
!...
-0.86
accla
-0.86
POSITIVE LOGITS
transit
0.92
subway
0.89
transit
0.76
Transit
0.71
rail
0.70
train
0.69
Transit
0.67
trains
0.63
transportation
0.61
Metro
0.60
Activations Density 0.116%