INDEX
    Explanations

    expressions of personal experience and feelings

    New Auto-Interp
    Negative Logits
     kasarigan
    -1.07
     noDo
    -0.82
    Personensuche
    -0.82
     betweenstory
    -0.82
    InitVars
    -0.80
    دانشنامهٔ
    -0.79
    EDEFAULT
    -0.78
     kaarangay
    -0.76
     Efq
    -0.73
    -0.73
    POSITIVE LOGITS
    '
    1.34
    1.32
    ve
    0.89
     ve
    0.79
    `
    0.75
    â
    0.69
     have
    0.68
    &#
    0.64
    \'
    0.61
    v
    0.60
    Act Density 0.211%

    No Known Activations