INDEX
Explanations
The neuron is tuned to spot the “Pro” token in “Pro Se” appellations indicating a self-represented party.
New Auto-Interp
Negative Logits
httpClient
-0.07
addictive
-0.07
h
-0.06
mort
-0.06
terk
-0.06
.Cancel
-0.06
âu
-0.06
.........
-0.06
icks
-0.06
CARD
-0.06
POSITIVE LOGITS
lista
0.07
disbelief
0.06
plataforma
0.06
�
0.06
гра
0.06
â
0.06
Ú
0.06
nutí
0.06
ırak
0.06
NonNull
0.06
Activations Density 0.000%