INDEX
Explanations
TV shows, segments, and other entertainment-related information
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
198
+0.11
0.3%
783
+0.10
0.3%
2010
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
2010
+0.11
0.06
1854
+0.10
0.04
783
+0.10
0.03
Negative Logits
fep
-1.17
tranf
-1.16
ftu
-1.15
unwarran
-1.07
fatis
-1.06
fto
-1.05
thut
-1.05
nece
-1.04
pollut
-1.01
perfon
-1.01
POSITIVE LOGITS
comedy
1.02
Comedy
0.93
comedian
0.89
comedians
0.88
comedy
0.85
comedic
0.84
humor
0.84
Comedy
0.79
hilarious
0.77
jokes
0.77
Activations Density 0.755%