INDEX
Explanations
mentions of fantasy as a genre or theme in various contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.29
1.7%
71
+0.13
0.8%
377
+0.13
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
353
+0.29
0.02
231
+0.13
0.02
74
+0.13
0.01
Negative Logits
ticket
-1.44
same
-1.41
ori
-1.37
trusted
-1.36
success
-1.35
donate
-1.34
еÑģ
-1.34
ERT
-1.33
ott
-1.32
ritic
-1.32
POSITIVE LOGITS
istically
2.10
wise
1.83
ontally
1.70
ignment
1.64
"}](#
1.60
ually
1.57
keeping
1.55
itious
1.55
illac
1.51
kill
1.44
Activations Density 0.015%