INDEX
    Explanations

    statistical metrics, particularly the term "mean" and its variations in context

    New Auto-Interp
    Negative Logits
     ddelweddau
    -0.69
     arşivlendi
    -0.67
     noqa
    -0.66
    stě
    -0.63
    .."
    -0.61
    ..."
    -0.61
    才會
    -0.60
    icoot
    -0.60
     Joni
    -0.60
    yti
    -0.59
    POSITIVE LOGITS
    mean
    2.55
     Mean
    2.46
     mean
    2.40
    Mean
    2.29
    MEAN
    2.18
     MEAN
    2.13
    平均
    1.06
     bedeuten
    0.87
    average
    0.86
     average
    0.86
    Act Density 0.115%

    No Known Activations