INDEX
Explanations
specific amusement park attractions related to a famous movie series
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.20
0.7%
559
+0.09
0.3%
971
+0.07
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
512
+0.20
0.04
1860
+0.09
0.04
138
+0.07
0.04
Negative Logits
<bos>
-2.63
ⓧ
-0.96
/**
-0.75
-0.74
<?
-0.74
SequentialGroup
-0.74
initComponents
-0.74
HasIndex
-0.73
зулта
-0.72
jsPsych
-0.71
POSITIVE LOGITS
affor
2.02
maneu
1.96
increa
1.88
volunte
1.83
fortn
1.81
wien
1.81
impra
1.79
fta
1.78
stockholm
1.77
strick
1.77
Activations Density 0.209%