INDEX
    Explanations

    references to new or fresh items or entities

    mentions of brands or brand-related terms

    New Auto-Interp
    Negative Logits
    ulhu
    -0.94
    pmwiki
    -0.87
    veyard
    -0.80
     Poverty
    -0.73
    abama
    -0.72
    SEE
    -0.71
    cale
    -0.69
    poons
    -0.69
     Able
    -0.69
    vae
    -0.69
    POSITIVE LOGITS
    ishing
    0.98
     loyalty
    0.96
    enburg
    0.87
    ished
    0.86
    brand
    0.82
    ages
    0.79
    Brand
    0.77
    aging
    0.77
    ishes
    0.77
     brand
    0.72
    Act Density 0.016%

    No Known Activations