INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
     preoperative
    0.46
     др
    0.43
     diarrhoea
    0.43
     enquire
    0.43
     economical
    0.42
     armament
    0.42
     προϊ
    0.42
     confidently
    0.42
    цаў
    0.42
     incidentally
    0.41
    POSITIVE LOGITS
     Add
    0.53
     الن
    0.50
     N
    0.45
     Diesel
    0.45
     kär
    0.44
     MediaPlayer
    0.44
     Magic
    0.43
     Scan
    0.43
     sbagli
    0.43
    Add
    0.43
    Act Density 0.001%

    No Known Activations