INDEX
    Explanations

    instances of "by" followed by the word "the" or another numeral phrase

    New Auto-Interp
    Negative Logits
    تفصیلات
    -0.46
    -0.44
    beelden
    -0.44
     helst
    -0.41
     adelante
    -0.40
    retudo
    -0.40
    gelöst
    -0.40
    horabuena
    -0.40
    SuppressLint
    -0.40
     perbaikan
    -0.40
    POSITIVE LOGITS
     virtue
    0.89
     means
    0.79
    standers
    0.76
     dint
    0.72
    stander
    0.71
    products
    0.68
    stolic
    0.64
     default
    0.63
     analogy
    0.62
     way
    0.60
    Act Density 0.269%

    No Known Activations