INDEX
    Explanations

    proper nouns in various languages related to locations

    words related to healthcare and medical conditions

    New Auto-Interp
    Negative Logits
    Analysis
    -0.68
    Fair
    -0.67
    ELF
    -0.64
    Making
    -0.64
    Inc
    -0.63
     Accountability
    -0.61
    Shock
    -0.61
    Full
    -0.60
    Notice
    -0.60
    Trigger
    -0.59
    POSITIVE LOGITS
     pione
    0.84
    pta
    0.84
     kan
    0.80
     mi
    0.77
    jet
    0.75
    iage
    0.74
    inem
    0.74
     gust
    0.73
     vi
    0.73
     tro
    0.73
    Act Density 0.125%

    No Known Activations