INDEX
    Explanations

    verbs and their various forms

    New Auto-Interp
    Negative Logits
    FFFFFFFF
    -0.17
    parable
    -0.16
    zar
    -0.16
    ÑĦоÑĢ
    -0.16
    ypse
    -0.16
    odge
    -0.15
    antz
    -0.14
    ombine
    -0.14
    iphy
    -0.14
    952
    -0.14
    POSITIVE LOGITS
    inded
    0.15
     rab
    0.14
     Wich
    0.14
     neckline
    0.14
    WithTitle
    0.14
    ample
    0.14
     Unit
    0.14
     ÄĮech
    0.14
     scales
    0.14
     Stephen
    0.14
    Act Density 0.001%

    No Known Activations