INDEX
    Explanations

    This neuron activates on HTML character entity references (sequences like “&name;”).

    New Auto-Interp
    Negative Logits
    plit
    -0.07
     Valve
    -0.07
     rij
    -0.07
     uk
    -0.06
    -0.06
    обав
    -0.06
    350
    -0.06
    /temp
    -0.06
     Ваш
    -0.06
    +len
    -0.06
    POSITIVE LOGITS
    พยาบาล
    0.07
     ziyaret
    0.06
    0.06
     PERMISSION
    0.06
    การเล
    0.06
    _annotations
    0.06
    用户名
    0.06
     salute
    0.06
     ş
    0.06
    =device
    0.06
    Act Density 0.006%

    No Known Activations