INDEX
    Explanations

    phrases that express comparison or emphasize the significance of something beyond its surface value

    New Auto-Interp
    Negative Logits
     non
    -0.06
    sert
    -0.06
    steen
    -0.06
    ast
    -0.06
    formats
    -0.06
     Pony
    -0.05
    urses
    -0.05
    lyn
    -0.05
    izada
    -0.05
    ez
    -0.05
    POSITIVE LOGITS
     merely
    0.13
     mere
    0.12
    mere
    0.11
     пÑĢоÑģÑĤо
    0.10
     Simply
    0.09
     simply
    0.09
     simplement
    0.09
    åıªæĺ¯
    0.08
    ForObject
    0.07
    Simply
    0.07
    Act Density 0.020%

    No Known Activations