INDEX
    Explanations

    the presence of slashes in text

    New Auto-Interp
    Negative Logits
    featureID
    -0.75
     betweenstory
    -0.73
     kaarangay
    -0.57
    WriteBarrier
    -0.56
    GEBURTSDATUM
    -0.55
    GIVEREF
    -0.52
     Wikimedijinoj
    -0.50
     ویکی‌پدی
    -0.50
    DockStyle
    -0.50
     aDecoder
    -0.50
    POSITIVE LOGITS
    出版年
    0.43
     تضيفلها
    0.35
     énergé
    0.31
     fatica
    0.31
     (){
    0.31
     forklift
    0.30
     sauvages
    0.30
    ม้
    0.30
    intios
    0.29
     lembran
    0.29
    Act Density 0.000%

    No Known Activations