INDEX
    Explanations

    conditional outcomes or exceptions

    New Auto-Interp
    Negative Logits
     relieves
    0.45
    的作用
    0.41
     putem
    0.39
     collecting
    0.39
     longo
    0.38
     Maintain
    0.38
     tal
    0.37
    வதால்
    0.36
    संक
    0.36
     করিলাম
    0.36
    POSITIVE LOGITS
     inexplic
    0.52
     arbitrarily
    0.50
     отказыва
    0.49
     प्रीमियम
    0.48
     refus
    0.47
    差別
    0.47
     premium
    0.47
     mysteriously
    0.47
    premium
    0.46
     할인
    0.46
    Act Density 0.059%

    No Known Activations