INDEX
    Explanations

    ownership and possessive language

    New Auto-Interp
    Negative Logits
    runner
    -0.18
    лав
    -0.16
    ÏĢά
    -0.15
    Runner
    -0.15
     Runner
    -0.14
     Rede
    -0.14
    CLR
    -0.14
    roads
    -0.14
    Router
    -0.14
    xor
    -0.14
    POSITIVE LOGITS
     right
    0.91
    right
    0.73
     Right
    0.69
    -right
    0.69
    _right
    0.67
    Right
    0.66
     RIGHT
    0.66
    .right
    0.64
    	right
    0.60
    åı³
    0.55
    Act Density 0.235%

    No Known Activations