INDEX
    Explanations

    expressions of degree or intensity

    New Auto-Interp
    Negative Logits
    ennen
    -0.15
    inbox
    -0.14
    .engine
    -0.14
    oÄį
    -0.14
    ilmington
    -0.14
     Marketable
    -0.14
    acz
    -0.14
     Hicks
    -0.14
    rlen
    -0.13
    esco
    -0.13
    POSITIVE LOGITS
    AVA
    0.14
    κι
    0.14
    Invoker
    0.14
    apur
    0.14
    Forgery
    0.14
    å°½
    0.14
    Lights
    0.14
    REW
    0.13
    dojo
    0.13
    eration
    0.13
    Act Density 0.005%

    No Known Activations