INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Skocz
    -0.70
     Мексичка
    -0.68
    GEBURTSDATUM
    -0.65
     समीक्षाओं
    -0.59
     ویکی‌پدی
    -0.58
     بيها
    -0.56
     @}
    -0.55
    idigung
    -0.55
    SourceChecksum
    -0.54
    évaluateur
    -0.54
    POSITIVE LOGITS
     caught
    1.42
     catch
    1.42
     Catch
    1.37
     catches
    1.34
    caught
    1.34
     catching
    1.31
    catch
    1.20
     CATCH
    1.19
    Catch
    1.18
    Caught
    1.16
    Act Density 0.136%

    No Known Activations