INDEX
    Explanations

    phrases related to authorship or attribution in documents

    instances of the word "for" in various contexts

    New Auto-Interp
    Negative Logits
    soever
    -0.75
     [+
    -0.67
     quo
    -0.65
     alive
    -0.64
    hazard
    -0.64
    ynski
    -0.63
     nevertheless
    -0.63
    ÃŁ
    -0.62
    egu
    -0.61
    ptions
    -0.61
    POSITIVE LOGITS
    geries
    1.23
    bidden
    0.91
    gery
    0.87
     inclusion
    0.85
    imeo
    0.81
     Collider
    0.73
     listeners
    0.73
     sale
    0.70
     viewers
    0.69
    ummies
    0.69
    Act Density 0.208%

    No Known Activations