INDEX
    Explanations

    mentions of specific locations or organizations

    the indefinite article 'a' in various contexts

    New Auto-Interp
    Negative Logits
    agree
    -0.67
    oids
    -0.66
    everything
    -0.66
    onel
    -0.65
     admission
    -0.63
     excuse
    -0.62
    ody
    -0.62
     ANGEL
    -0.62
    Hour
    -0.62
     encour
    -0.62
    POSITIVE LOGITS
     defunct
    0.98
     prominent
    0.85
     variety
    0.84
     nearby
    0.84
     handful
    0.84
     prestigious
    0.83
     subsidiary
    0.83
     fictitious
    0.81
     dozen
    0.79
     local
    0.78
    Act Density 0.283%

    No Known Activations