INDEX
    Explanations

    words indicating intensity or frequency of actions or states

    New Auto-Interp
    Negative Logits
    "]').
    -0.77
    ="{{
    -0.76
    UnusedPrivate
    -0.75
    ']").
    -0.74
     ")
    
    -0.70
    '=>$
    -0.70
    ']);
    
    -0.70
    "){
    
    -0.69
     ?>/
    -0.68
    }`).
    -0.68
    POSITIVE LOGITS
     >=",
    0.67
    Demografie
    0.64
     zaidi
    0.59
     enough
    0.58
    tawesome
    0.56
    pení
    0.55
     خارجية
    0.53
    gawai
    0.53
     again
    0.53
    eclampsia
    0.52
    Act Density 0.497%

    No Known Activations