INDEX
    Explanations

    words related to breaking down, analyzing, or deconstructing complex concepts or structures, such as systems, mechanisms, texts, and individuals

    New Auto-Interp
    Negative Logits
    arak
    -0.68
    mong
    -0.62
    nder
    -0.62
    nor
    -0.60
    jab
    -0.59
     recall
    -0.59
     Patron
    -0.58
    ught
    -0.58
     Promise
    -0.58
    visor
    -0.58
    POSITIVE LOGITS
     barriers
    0.86
    sheets
    0.83
    taining
    0.75
    stairs
    0.73
    shit
    0.71
    baugh
    0.69
    inately
    0.68
    casts
    0.67
    grades
    0.67
     stairs
    0.67
    Act Density 0.026%

    No Known Activations