INDEX
    Explanations

    references to structured data or categories related to software and systems

    This neuron appears to be detecting spam or low-quality content, particularly product advertisements and incoherent text passages.

    New Auto-Interp
    Negative Logits
     linkovi
    -0.38
     either
    -0.36
     ivelany
    -0.36
     usually
    -0.35
    either
    -0.35
     lo
    -0.34
    hos
    -0.34
     preceding
    -0.33
     /
    -0.33
     ppl
    -0.33
    POSITIVE LOGITS
    :✨
    1.61
    Portail
    0.63
    قایناق‌لار
    0.60
     للاسماء
    0.60
     Verſ
    0.56
    erintah
    0.54
     AttributeSet
    0.54
    AsUp
    0.51
    rbrakk
    0.50
    Datuak
    0.49
    Act Density 0.013%

    No Known Activations