INDEX
Explanations
The main thing this neuron does is find mentions of top entities or superlatives in the context of the world or global scope
occurrences of the letter "s" in various contexts
New Auto-Interp
Negative Logits
Lauder
-0.72
Å¡
-0.68
¿½
-0.66
docker
-0.64
Slate
-0.62
Sale
-0.62
iov
-0.60
yz
-0.60
HOU
-0.59
bush
-0.59
POSITIVE LOGITS
selves
1.06
tallest
0.84
premiere
0.84
foremost
0.81
Greatest
0.78
greatest
0.77
ifted
0.77
oldest
0.75
pecially
0.75
ankind
0.74
Activations Density 0.101%