INDEX
    Explanations

    specific nouns and dynamic verbs that indicate actions or processes related to structure and systems

    New Auto-Interp
    Negative Logits
     soft
    -0.16
    lesi
    -0.16
     te
    -0.15
    .orange
    -0.15
     Dol
    -0.15
     tic
    -0.14
    ene
    -0.14
     dap
    -0.14
    inese
    -0.14
    une
    -0.14
    POSITIVE LOGITS
    adir
    0.16
    inton
    0.15
    ubat
    0.14
    asters
    0.14
    omu
    0.14
    aval
    0.14
    trx
    0.14
    imu
    0.14
    rish
    0.13
     Initialized
    0.13
    Act Density 0.019%

    No Known Activations