INDEX
    Explanations

    names of political figures

    proper nouns and names, particularly related to individuals and entities

    New Auto-Interp
    Negative Logits
     Invention
    -0.67
    olulu
    -0.66
    cffffcc
    -0.65
    retty
    -0.62
    ŃĶ
    -0.59
    ļéĨĴ
    -0.59
    };
    -0.59
     Gloria
    -0.56
    .�
    -0.56
     Palest
    -0.55
    POSITIVE LOGITS
     will
    0.97
     intends
    0.93
     could
    0.93
     somehow
    0.92
     might
    0.89
     should
    0.89
     someday
    0.88
     would
    0.88
     qualifies
    0.86
     succeeds
    0.85
    Act Density 0.479%

    No Known Activations