INDEX
Explanations
This neuron appears to activate on a variety of structured code-like text, possibly related to specific characters or structured formatting within code or data.
non-English words or code-related terms
code/foreign languages
New Auto-Interp
Negative Logits
EDEFAULT
-0.77
صوتيه
-0.76
tanleria
-0.74
InitVars
-0.73
Obrázky
-0.73
WriteBarrier
-0.71
виправивши
-0.68
TypedDataSet
-0.68
bootstrapcdn
-0.67
niggas
-0.66
POSITIVE LOGITS
adí
0.47
jLabel
0.46
док
0.42
للاسماء
0.42
+#+
0.42
enumi
0.40
وض
0.40
сок
0.40
findpost
0.40
unek
0.39
Activations Density 1.214%