INDEX
    Explanations

    adjectives and adverbs indicating quality or effectiveness

    New Auto-Interp
    Negative Logits
    increments
    -0.17
    Slinky
    -0.16
     {{--<
    -0.15
    ivec
    -0.15
     erotique
    -0.14
     addCriterion
    -0.14
    .FC
    -0.14
    orang
    -0.14
    amac
    -0.14
    .fc
    -0.14
    POSITIVE LOGITS
     ast
    0.24
     has
    0.22
    has
    0.21
     ad
    0.20
     quanto
    0.19
     than
    0.19
     s
    0.18
     anybody
    0.18
     ae
    0.17
    HAS
    0.17
    Act Density 0.044%

    No Known Activations