INDEX
    Explanations

    and followed by descriptors

    New Auto-Interp
    Negative Logits
     devoid
    0.71
     prone
    0.69
     Doesn
    0.64
    prone
    0.61
     susceptibles
    0.61
     nói
    0.60
     ต้อง
    0.59
     reminiscent
    0.59
     capaces
    0.59
     دارای
    0.59
    POSITIVE LOGITS
    -
    0.63
     unwitting
    0.62
    -]
    0.59
    -)
    0.58
     unexpected
    0.54
     albeit
    0.53
     sizable
    0.53
    -​
    0.53
     unanticipated
    0.52
     admittedly
    0.52
    Act Density 0.583%

    No Known Activations