INDEX
    Explanations

    adjectives describing appearance or condition

    descriptive adjectives indicating quality or similarity

    New Auto-Interp
    Negative Logits
    mental
    -0.75
    ricular
    -0.75
     Childhood
    -0.70
    byter
    -0.69
    shire
    -0.65
    iculty
    -0.63
    selling
    -0.61
    igion
    -0.60
    ritical
    -0.60
     submission
    -0.59
    POSITIVE LOGITS
     lifeless
    0.79
    bones
    0.79
    bley
    0.79
     suspic
    0.74
     sleek
    0.73
     ugly
    0.72
     shiny
    0.70
    pretty
    0.69
     differently
    0.69
     suspicious
    0.69
    Act Density 0.144%

    No Known Activations