INDEX
    Explanations

    phrases related to design and intention in various contexts

    instances of the word "designed."

    New Auto-Interp
    Negative Logits
    dash
    -0.73
     Ezek
    -0.68
    enne
    -0.68
    nikov
    -0.66
    held
    -0.65
     Isa
    -0.64
    Charge
    -0.63
    rika
    -0.63
     Michaels
    -0.63
    hus
    -0.63
    POSITIVE LOGITS
    urally
    0.98
    ators
    0.82
     flaw
    0.76
     Parenthood
    0.76
    ural
    0.75
     layouts
    0.74
    ating
    0.72
    yout
    0.71
     designing
    0.70
    ĸļ
    0.70
    Act Density 0.029%

    No Known Activations