INDEX
    Explanations

    exceptions to a general rule or trend

    phrases indicating exceptions or exclusions

    New Auto-Interp
    Negative Logits
     neighb
    -0.71
     rall
    -0.67
     cart
    -0.65
     ancest
    -0.63
    oded
    -0.61
    eny
    -0.61
     ger
    -0.61
     precinct
    -0.59
     parcel
    -0.58
    eng
    -0.58
    POSITIVE LOGITS
     exceptions
    0.78
    dylib
    0.76
    DragonMagazine
    0.70
    blance
    0.70
     Legendary
    0.69
    rities
    0.69
    ptions
    0.66
    ĨĴ
    0.66
    ngth
    0.65
    perty
    0.65
    Act Density 0.015%

    No Known Activations