INDEX
Explanations
military service
This neuron detects references to LGBTQ identities (gay, lesbian, bisexual, transgender) in the context of military service.
New Auto-Interp
Negative Logits
nuevo
-0.08
즈
-0.07
X
-0.07
would
-0.07
burge
-0.06
asmus
-0.06
ASF
-0.06
Would
-0.06
Stand
-0.06
.te
-0.06
POSITIVE LOGITS
Differences
0.07
[],↵
0.06
тільки
0.06
[]);↵
0.06
drama
0.06
<<"
0.06
(handles
0.06
reckon
0.06
Figures
0.06
gifts
0.06
Activations Density 0.003%