INDEX
    Explanations

    adjectives for different kinds of things

    terms indicating various categories or types of items or situations

    New Auto-Interp
    Negative Logits
    edia
    -0.95
     presidency
    -0.73
    ahime
    -0.72
    opsis
    -0.72
    ulhu
    -0.69
    eka
    -0.67
    aeper
    -0.67
    instein
    -0.67
     rapist
    -0.66
    gary
    -0.66
    POSITIVE LOGITS
     kinds
    0.77
     imaginable
    0.77
     goodies
    0.77
    hell
    0.77
    hots
    0.71
     varied
    0.71
     different
    0.69
    itionally
    0.69
    alities
    0.69
     things
    0.68
    Act Density 0.021%

    No Known Activations