INDEX
    Explanations

    phrases indicating action, participation, or states of being

    New Auto-Interp
    Negative Logits
    iel
    -0.15
    ner
    -0.14
    https
    -0.14
     века
    -0.14
     https
    -0.14
    utch
    -0.14
    uba
    -0.13
    	goto
    -0.13
     concerns
    -0.13
    403
    -0.13
    POSITIVE LOGITS
     hopefully
    0.17
    sterol
    0.16
    ophe
    0.15
    TextStyle
    0.14
    .gdx
    0.14
    sembly
    0.14
    thew
    0.14
    leÅŁik
    0.14
    reesome
    0.14
    hopefully
    0.14
    Act Density 0.040%

    No Known Activations