INDEX
Explanations
references to structured data or categories related to software and systems
This neuron appears to be detecting spam or low-quality content, particularly product advertisements and incoherent text passages.
New Auto-Interp
Negative Logits
linkovi
-0.38
either
-0.36
ivelany
-0.36
usually
-0.35
either
-0.35
lo
-0.34
hos
-0.34
preceding
-0.33
/
-0.33
ppl
-0.33
POSITIVE LOGITS
:✨
1.61
Portail
0.63
قایناقلار
0.60
للاسماء
0.60
Verſ
0.56
erintah
0.54
AttributeSet
0.54
AsUp
0.51
rbrakk
0.50
Datuak
0.49
Activations Density 0.013%