INDEX
    Explanations

    descriptive actions or states

    New Auto-Interp
    Negative Logits
     try
    0.44
    0.44
     always
    0.43
     obviously
    0.41
     যের
    0.41
    okus
    0.40
     அதிகமாக
    0.40
     Syk
    0.40
    try
    0.40
     rematch
    0.40
    POSITIVE LOGITS
     graced
    0.94
     donning
    0.69
     graces
    0.63
     basking
    0.61
     gleaming
    0.60
     crad
    0.60
     soaring
    0.59
     reverber
    0.56
     bustling
    0.55
     proudly
    0.53
    Act Density 0.009%

    No Known Activations