INDEX
Explanations
This neuron appears to be looking for a specific pattern of characters or words that don't conform to any recognizable language or structure
specific characters or symbols, potentially from various languages or encoding systems
New Auto-Interp
Negative Logits
bonded
-0.78
cius
-0.76
flour
-0.75
biod
-0.73
bour
-0.73
dominated
-0.73
rigged
-0.73
centrally
-0.71
glossy
-0.71
polyg
-0.70
POSITIVE LOGITS
ãģŁ
1.86
ãģ¦
1.82
ãģĦ
1.81
ãģ¾
1.81
ãĤĭ
1.80
ãĤ
1.78
ãģ
1.75
ãģª
1.75
ãĢģ
1.72
ãģ§
1.71
Activations Density 0.019%