INDEX
    Explanations

    mentions of names and proper nouns

    New Auto-Interp
    Negative Logits
     bleach
    -0.85
     Bleach
    -0.73
    ¡
    -0.73
    207
    -0.72
    idth
    -0.67
    Ble
    -0.64
     Beckham
    -0.64
     Chevron
    -0.64
    ected
    -0.64
    OUP
    -0.63
    POSITIVE LOGITS
    m
    1.41
    M
    1.25
    mic
    1.13
    MI
    1.10
    mt
    1.10
    mill
    1.08
    mob
    1.07
    MN
    1.06
    mo
    1.06
    Ms
    1.05
    Act Density 0.276%

    No Known Activations