INDEX
    Explanations

    proper nouns, such as names of individuals, locations, and organizations

    New Auto-Interp
    Negative Logits
     weave
    -0.57
     gauge
    -0.55
    icably
    -0.55
     Gandhi
    -0.55
    icable
    -0.55
     oppressed
    -0.54
     ($)
    -0.53
     KKK
    -0.53
    1800
    -0.53
     Ethiop
    -0.53
    POSITIVE LOGITS
    idon
    1.03
    EStream
    0.84
     Soup
    0.76
    stein
    0.76
    ornia
    0.76
    Fly
    0.75
    usky
    0.74
    cone
    0.74
    antine
    0.72
    glers
    0.71
    Act Density 5.946%

    No Known Activations