INDEX
    Explanations

    words and phrases related to design and design elements

    New Auto-Interp
    Negative Logits
    tes
    -0.17
    ikk
    -0.17
    ppy
    -0.17
    iker
    -0.17
    ken
    -0.15
    isen
    -0.15
    king
    -0.14
    unami
    -0.14
    izen
    -0.14
     Pf
    -0.14
    POSITIVE LOGITS
    å¸Ī
    0.18
    -build
    0.18
    ates
    0.18
    /design
    0.17
    eated
    0.16
    ees
    0.15
    adamente
    0.15
    /pl
    0.15
    filt
    0.15
    ated
    0.14
    Act Density 0.064%

    No Known Activations