INDEX
    Explanations

    imperative verbs and phrases indicating requests or actions

    New Auto-Interp
    Negative Logits
    ssa
    -0.15
    itorio
    -0.15
    Ĩ
    -0.15
    ijn
    -0.14
    igure
    -0.14
    stvo
    -0.14
    ificaciones
    -0.14
    esson
    -0.14
    	fflush
    -0.13
    setattr
    -0.13
    POSITIVE LOGITS
    lias
    0.16
    kuk
    0.15
    ctor
    0.14
    zee
    0.14
    ILA
    0.14
    allas
    0.14
    ople
    0.14
     Gig
    0.14
    iams
    0.14
    cala
    0.14
    Act Density 0.030%

    No Known Activations