INDEX
    Explanations

    various forms of the verb "blow."

    New Auto-Interp
    Negative Logits
    ussia
    -0.15
    ousse
    -0.15
    mes
    -0.15
    ummings
    -0.15
     vic
    -0.15
    osy
    -0.14
    ılıç
    -0.14
    oret
    -0.14
    uzzle
    -0.14
    LineStyle
    -0.14
    POSITIVE LOGITS
     blew
    0.22
     kisses
    0.20
    torch
    0.20
     blown
    0.19
     Blow
    0.19
     blow
    0.19
     Fuse
    0.19
     apart
    0.18
     blowing
    0.18
     Kiss
    0.17
    Act Density 0.011%

    No Known Activations