INDEX
Explanations
No Explanations Found
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
56
+0.13
0.7%
249
+0.13
0.7%
350
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
0
-0.13
0.00
1
-0.13
0.00
2
-0.11
0.00
Negative Logits
ships
-1.73
burgh
-1.52
street
-1.38
Broadway
-1.38
grievances
-1.38
wich
-1.34
INGS
-1.34
history
-1.34
expans
-1.32
baum
-1.29
POSITIVE LOGITS
mology
1.64
]{}]{}1.62
ati
1.60
nat
1.49
ini
1.47
]{}\1.46
]{}1.43
]{}[1.42
ain
1.41
olin
1.40
Activations Density 0.000%
No Known Activations
This feature has no known activations.