INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Focus
    -0.06
    (style
    -0.06
    бора
    -0.06
     switched
    -0.06
     Jonas
    -0.06
    .isdigit
    -0.06
     echoed
    -0.06
    rush
    -0.06
     oss
    -0.05
     domicile
    -0.05
    POSITIVE LOGITS
     jejich
    0.07
    0.07
    ponder
    0.07
    0.06
    screens
    0.06
    $wp
    0.06
    $conn
    0.06
     köy
    0.06
    .want
    0.06
    PERT
    0.06
    Act Density 0.036%

    No Known Activations