INDEX
    Explanations

    verbs and phrases associated with actions or transformations

    New Auto-Interp
    Negative Logits
    alsa
    -0.19
    irl
    -0.17
    ãģĸ
    -0.17
    hle
    -0.15
     Gregg
    -0.15
    bak
    -0.15
    íĦ´
    -0.15
    UMENT
    -0.15
    rale
    -0.15
    bben
    -0.14
    POSITIVE LOGITS
    ighthouse
    0.15
    amage
    0.15
     Burton
    0.15
    bish
    0.15
     rein
    0.14
    ̣
    0.14
    ational
    0.14
     Iron
    0.14
    eya
    0.14
    iesz
    0.14
    Act Density 0.018%

    No Known Activations