INDEX
    Explanations

    the use of the word "On" in different contexts

    New Auto-Interp
    Negative Logits
    ahlen
    -0.16
    azon
    -0.16
    eum
    -0.15
     McKay
    -0.15
    ulously
    -0.15
    jack
    -0.15
    à¸Ńà¸ĩà¸Ħ
    -0.14
    leaflet
    -0.14
    ól
    -0.14
    ntl
    -0.14
    POSITIVE LOGITS
    nen
    0.22
    /off
    0.21
     behalf
    0.21
    yx
    0.20
    ishi
    0.19
    eness
    0.17
    nn
    0.16
    slow
    0.16
    shore
    0.16
    kud
    0.16
    Act Density 0.061%

    No Known Activations