INDEX
    Explanations

    high quality expression

    New Auto-Interp
    Negative Logits
    Neither
    0.47
    neither
    0.44
    izzo
    0.44
    பின்னர்
    0.43
     occasional
    0.42
     neither
    0.41
    temporary
    0.41
     periodic
    0.41
    ちなみに
    0.40
    いくつかの
    0.39
    POSITIVE LOGITS
     extensively
    1.33
     بشكل
    1.17
     बखूबी
    1.02
     wholeheartedly
    1.01
     vividly
    0.97
     galore
    0.96
     admirably
    0.94
    ຢ່າງ
    0.94
     thoroughly
    0.91
     emphatically
    0.91
    Act Density 0.038%

    No Known Activations