INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     zug
    -0.07
     Alonso
    -0.06
    books
    -0.06
     полот
    -0.06
    THON
    -0.06
     Tuple
    -0.06
    _Register
    -0.06
     Statements
    -0.06
     Daemon
    -0.06
     flavours
    -0.06
    POSITIVE LOGITS
     near
    0.15
     Near
    0.09
     nearby
    0.08
    0.08
    near
    0.07
    ────
    0.07
    Near
    0.07
    0.07
    Ge
    0.07
    		
    ↵		
    ↵
    0.07
    Act Density 0.017%

    No Known Activations