INDEX
    Explanations

    a specific type of word occuring before proper nouns or short phrases

    instances of articles that indicate specific subjects or entities being introduced in a description

    New Auto-Interp
    Negative Logits
    enance
    -0.83
    imentary
    -0.75
    aos
    -0.74
    encies
    -0.72
    anism
    -0.69
    SIGN
    -0.68
    eye
    -0.67
    Contents
    -0.67
    ophob
    -0.65
    advertisement
    -0.65
    POSITIVE LOGITS
     successor
    0.93
     precursor
    0.92
     remnant
    0.86
     thinly
    0.85
     small
    0.85
     reference
    0.85
    hem
    0.84
     longtime
    0.84
     powerful
    0.83
     member
    0.82
    Act Density 0.138%

    No Known Activations