INDEX
Explanations
The main thing this neuron does is find events where a specific location becomes the first or one of the first to do something
instances of countries being mentioned in relation to specific statuses or rankings
New Auto-Interp
Negative Logits
åħī
-0.64
worm
-0.64
mediated
-0.63
Peng
-0.62
dfx
-0.62
fram
-0.61
matter
-0.61
cv
-0.60
00000
-0.60
illusions
-0.60
POSITIVE LOGITS
safest
0.97
destination
0.89
birthplace
0.87
swing
0.87
destinations
0.83
havens
0.83
battleground
0.81
capitals
0.78
jurisdictions
0.74
orest
0.74
Activations Density 0.186%