INDEX
    Explanations

    sentences that express strong opinions or emotional responses

    New Auto-Interp
    Negative Logits
     queſta
    -0.64
     lcm
    -0.59
    جغرافيا
    -0.59
    findpost
    -0.59
    MethodManager
    -0.59
    TestingModule
    -0.58
    -0.58
    Cyfeiriadau
    -0.57
    batore
    -0.56
    出版年
    -0.56
    POSITIVE LOGITS
     básica
    0.30
    impianto
    0.29
     bapak
    0.29
     morire
    0.29
    pañas
    0.28
     hermanos
    0.28
     stolz
    0.27
     powinna
    0.26
     folosit
    0.26
     dumne
    0.26
    Act Density 0.038%

    No Known Activations