INDEX
Explanations
Image captions
The neuron activates strongly on named-entity tokens—that is, proper names of people, places, and organizations.
New Auto-Interp
Negative Logits
released
-0.07
Those
-0.07
ceptors
-0.07
Pandora
-0.07
cũng
-0.07
Astr
-0.07
@test
-0.06
longitude
-0.06
Digital
-0.06
Transportation
-0.06
POSITIVE LOGITS
coroutine
0.06
侧
0.06
_IF
0.06
repository
0.06
categorie
0.06
projecting
0.06
ıb
0.06
.jwt
0.06
pollutants
0.06
repairs
0.06
Activations Density 0.010%