INDEX
    Explanations

    terms and phrases indicating large quantity or magnitude

    New Auto-Interp
    Negative Logits
    isan
    -0.16
     ?><?
    -0.15
    ford
    -0.15
    ÑĢеж
    -0.14
    à¥įषण
    -0.14
    /Image
    -0.14
    imers
    -0.14
    ving
    -0.14
    ensis
    -0.14
    sWith
    -0.14
    POSITIVE LOGITS
     amounts
    0.38
     amount
    0.32
    amount
    0.31
     Amount
    0.28
    -scale
    0.26
     quantities
    0.23
    Domains
    0.23
    Amount
    0.22
    .amount
    0.20
     amt
    0.20
    Act Density 0.029%

    No Known Activations