INDEX
Explanations
characters and symbols from a non-Latin script, likely related to a specific language
This neuron is looking for text in a language that uses the Cyrillic alphabet
New Auto-Interp
Negative Logits
assador
-0.78
essa
-0.78
emonium
-0.75
combe
-0.73
Starr
-0.69
ierrez
-0.69
iqueness
-0.69
worldly
-0.67
aido
-0.67
ernels
-0.67
POSITIVE LOGITS
к
1.61
ÑĤ
1.57
м
1.53
е
1.53
Ñ
1.51
Ñı
1.50
д
1.47
ÑĢ
1.47
Ð
1.44
л
1.39
Activations Density 0.012%