INDEX
    Explanations

    proper nouns like names of individuals

    New Auto-Interp
    Negative Logits
    stract
    -0.70
    ADRA
    -0.70
    ctory
    -0.68
    ça
    -0.66
    UGE
    -0.64
    PDATE
    -0.64
    nces
    -0.63
     sympt
    -0.62
     pregn
    -0.61
     Greenpeace
    -0.61
    POSITIVE LOGITS
    sonian
    1.58
    son
    0.93
    smanship
    0.90
    anity
    0.87
    ies
    0.86
    tein
    0.84
     Barney
    0.84
    inelli
    0.83
    sburg
    0.83
    gren
    0.81
    Act Density 7.762%

    No Known Activations