INDEX
    Explanations

    admit/grant

    New Auto-Interp
    Negative Logits
    instead
    -0.08
     दिशा
    -0.08
     clarified
    -0.08
     recommendations
    -0.08
     convictions
    -0.07
     اص
    -0.07
     reminders
    -0.07
    _READY
    -0.07
     निर्देश
    -0.07
     commandments
    -0.07
    POSITIVE LOGITS
     acknowledge
    0.10
     acknowledging
    0.10
     acknowledges
    0.09
     acknowledged
    0.09
    0.09
     imperfections
    0.09
     Vir
    0.08
     oefenen
    0.08
     인정
    0.08
     admitir
    0.08
    Act Density 0.038%

    No Known Activations