INDEX
    Explanations

    names of researchers, editors, and authors in academic contexts

    plural nouns or nouns with similar endings

    New Auto-Interp
    Negative Logits
     exception
    -0.62
     exceptions
    -0.59
     Ocean
    -0.58
     fraction
    -0.58
     recharge
    -0.56
     exemptions
    -0.56
     comparable
    -0.55
    SIGN
    -0.55
    ASED
    -0.55
     arts
    -0.55
    POSITIVE LOGITS
    ki
    1.83
    kaya
    1.75
    ky
    1.72
    mith
    1.69
    hire
    1.57
    nyder
    1.47
    hip
    1.45
    iewicz
    1.36
    haw
    1.36
    cu
    1.35
    Act Density 0.181%

    No Known Activations