INDEX
Explanations
This neuron detects names of people (proper nouns) marking speaker or interviewee names.
New Auto-Interp
Negative Logits
光
-0.07
上
-0.07
제출
-0.06
spanish
-0.06
itled
-0.06
_OUT
-0.06
Sign
-0.06
OCI
-0.06
*T
-0.06
AssemblyCopyright
-0.06
POSITIVE LOGITS
Projects
0.07
payments
0.07
HttpMethod
0.06
.Broadcast
0.06
jub
0.06
anan
0.06
déjà
0.06
convey
0.06
initialState
0.06
till
0.06
Activations Density 0.017%