INDEX
Explanations
The neuron detects tokens that indicate a person’s marital status, especially “married” (and its equivalents in other languages).
New Auto-Interp
Negative Logits
opening
-0.08
_WORD
-0.07
doses
-0.07
Small
-0.07
kitchen
-0.07
safety
-0.06
Proposed
-0.06
_ID
-0.06
mouth
-0.06
ार
-0.06
POSITIVE LOGITS
아
0.06
ansible
0.06
ąż
0.06
、い
0.06
Turning
0.06
thay
0.06
得
0.06
Kingston
0.06
.navCtrl
0.06
<fieldset
0.06
Activations Density 0.009%