INDEX
Explanations
mentions of a specific animated TV show
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1757
+0.18
0.8%
486
+0.15
0.7%
25
+0.15
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
981
+0.18
0.04
227
+0.15
0.04
1757
+0.15
0.03
Negative Logits
مرئيه
-0.49
McNally
-0.46
Burroughs
-0.46
película
-0.46
庐
-0.46
Felton
-0.45
Kaye
-0.44
NgModule
-0.44
audiovisuel
-0.44
ny
-0.43
POSITIVE LOGITS
Simpsons
1.18
Simpson
1.11
simpsons
1.05
Bart
1.04
Homer
1.00
Simpson
0.98
Bart
0.95
Homer
0.95
simpson
0.95
Springfield
0.89
Activations Density 0.231%