INDEX
    Explanations

    references to specific quantities or degrees

    "a" followed by quantity words

    a followed by quantities

    New Auto-Interp
    Negative Logits
     respective
    -0.61
     respectively
    -0.54
    //});
    -0.53
    ^(@)
    -0.51
     respectivement
    -0.48
    ともに
    -0.48
    harapkan
    -0.48
    ſelf
    -0.48
    共に
    -0.48
    libft
    -0.47
    POSITIVE LOGITS
     nice
    1.00
    MLLoader
    0.98
     bit
    0.97
     lot
    0.94
     really
    0.92
     weird
    0.87
     few
    0.85
     little
    0.84
    really
    0.83
    WithIOException
    0.83
    Act Density 0.289%

    No Known Activations