INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Getting
    -0.07
    нице
    -0.07
     countless
    -0.06
     extends
    -0.06
     Frank
    -0.06
    Opened
    -0.06
     Attached
    -0.06
    inks
    -0.06
     bakery
    -0.06
    .sw
    -0.06
    POSITIVE LOGITS
     poplat
    0.07
     decid
    0.06
    ’ai
    0.06
     glBind
    0.06
    0.06
    _mov
    0.06
     @"↵
    0.06
     spolu
    0.06
     sunt
    0.06
     blossom
    0.06
    Act Density 0.210%

    No Known Activations