INDEX
    Explanations

    verbs that express command or caution

    New Auto-Interp
    Negative Logits
    OGND
    -0.69
     pylint
    -0.66
    requestData
    -0.62
    subsubsection
    -0.61
    pisah
    -0.61
    herself
    -0.58
    pira
    -0.57
    hingga
    -0.57
    gheny
    -0.56
    řel
    -0.56
    POSITIVE LOGITS
     Donny
    0.93
    Dont
    0.83
     Doy
    0.80
    TagHelper
    0.76
     Dont
    0.76
    dont
    0.75
     Jangan
    0.74
     beware
    0.73
    0.72
    Don
    0.72
    Act Density 0.069%

    No Known Activations