INDEX
    Explanations

    comparative statements beginning with "More than"

    comparative phrases indicating quantities or measures

    New Auto-Interp
    Negative Logits
    Interstitial
    -0.82
    EMENT
    -0.73
    ãĤ¼ãĤ¦ãĤ¹
    -0.73
    oris
    -0.70
    ALE
    -0.69
    ModLoader
    -0.68
    microsoft
    -0.68
    ALSE
    -0.67
    Ô
    -0.67
    ç·
    -0.66
    POSITIVE LOGITS
    whelming
    0.83
    ground
    0.67
    atos
    0.66
    etheless
    0.65
     500
    0.65
     200
    0.64
     400
    0.64
     8000
    0.63
     half
    0.63
     1500
    0.62
    Act Density 0.049%

    No Known Activations