INDEX
    Explanations

    prepositions indicating a relationship or connection between entities

    prepositions and certain conjunctions indicating relationships or positions

    New Auto-Interp
    Negative Logits
     WATCHED
    -0.71
    Written
    -0.69
    idav
    -0.68
    >>>>>>>>
    -0.68
     Edited
    -0.64
    ascript
    -0.63
    nikov
    -0.63
    pmwiki
    -0.62
     STATES
    -0.62
     Publications
    -0.62
    POSITIVE LOGITS
    hem
    0.69
    irlf
    0.65
    days
    0.63
    selves
    0.62
    ilk
    0.61
    GF
    0.61
    ngth
    0.60
    hest
    0.59
     stride
    0.58
     predecessors
    0.58
    Act Density 0.357%

    No Known Activations