INDEX
    Explanations

    phrases indicating a need for improvement or growth

    New Auto-Interp
    Negative Logits
    ava
    -0.15
    ห
    -0.14
     familiar
    -0.14
     fist
    -0.14
     mn
    -0.14
    odore
    -0.14
     attempt
    -0.14
     Fro
    -0.14
    dir
    -0.13
    261
    -0.13
    POSITIVE LOGITS
    #
    0.18
    iola
    0.16
    berger
    0.15
    UsageId
    0.15
    SEQUENTIAL
    0.15
    iyel
    0.15
    rieve
    0.15
    ÐĴС
    0.15
    ocos
    0.15
    OptionsResolver
    0.14
    Act Density 0.162%

    No Known Activations