INDEX
    Explanations

    text related to posting content online and engaging in online discussions

    references to posts, requests, or inquiries related to various topics, especially in the context of sharing information, events, and services

    New Auto-Interp
    Negative Logits
     looph
    -0.65
     wedd
    -0.61
    utsche
    -0.59
    ofi
    -0.58
    ueless
    -0.57
    AME
    -0.56
    idable
    -0.55
     repud
    -0.55
    atories
    -0.53
    DragonMagazine
    -0.53
    POSITIVE LOGITS
     or
    1.03
    .?
    1.00
     please
    0.99
     lately
    0.97
    please
    0.95
     :(
    0.94
     PLEASE
    0.90
     Please
    0.87
     Recently
    0.87
    /?
    0.84
    Act Density 0.593%

    No Known Activations