INDEX
    Explanations

    occurrences of the word "reply" in different contexts

    mentions of replies to comments or posts

    New Auto-Interp
    Negative Logits
    stakes
    -0.75
    elson
    -0.75
    veh
    -0.69
    Ĥª
    -0.67
    licts
    -0.65
     impacted
    -0.65
     eroded
    -0.62
    arc
    -0.61
     illegally
    -0.61
    ewater
    -0.61
    POSITIVE LOGITS
     reply
    3.91
     replies
    2.71
    reply
    2.59
    Reply
    2.07
     response
    2.01
     answer
    1.97
     Reply
    1.95
     replied
    1.85
     respond
    1.69
    Answer
    1.68
    Act Density 0.012%

    No Known Activations