INDEX
    Explanations

    references to average values in data

    New Auto-Interp
    Negative Logits
    ulemon
    -0.65
     DialogInterface
    -0.62
    sp
    -0.60
     Murdoch
    -0.56
     ed
    -0.56
    hasMoreElements
    -0.55
     sk
    -0.55
     emb
    -0.54
     Kirk
    -0.54
    führt
    -0.54
    POSITIVE LOGITS
     average
    3.23
    average
    3.04
     Average
    3.03
    Average
    2.96
     AVERAGE
    2.85
     averages
    2.64
    AVERAGE
    2.62
     averaged
    2.55
     avg
    2.46
     averaging
    2.46
    Act Density 0.078%

    No Known Activations