INDEX
Explanations
No Explanations Found
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
197
+0.13
0.7%
290
+0.13
0.7%
14
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
438
+0.13
0.40
53
+0.13
0.42
56
+0.12
0.11
Negative Logits
ricks
-1.70
óg
-1.67
ching
-1.57
\[[@
-1.37
asti
-1.37
ter
-1.29
yne
-1.27
uffer
-1.25
resolve
-1.25
ERC
-1.25
POSITIVE LOGITS
Caption
1.65
iably
1.62
himself
1.61
caption
1.50
footage
1.48
biography
1.45
laughter
1.45
caption
1.43
ħ
1.40
oneself
1.40
Activations Density 4.670%
No Known Activations
This feature has no known activations.