INDEX
Explanations
the word "Clear", which may have various contexts but shows a strong association with this neuron
mentions of the word "Clear" and related variations
New Auto-Interp
Negative Logits
hob
-0.74
mim
-0.73
dwar
-0.73
imaginary
-0.70
labou
-0.70
homage
-0.66
resp
-0.66
obos
-0.66
repr
-0.65
pun
-0.65
POSITIVE LOGITS
Clear
3.75
Clear
3.41
clear
2.36
clear
1.52
CLE
1.40
clearance
1.27
clears
1.25
Cle
1.25
Bright
1.18
Plain
1.13
Activations Density 0.017%