INDEX
    Explanations

    predicts human judgment

    New Auto-Interp
    Negative Logits
     signatories
    0.52
     internalization
    0.46
    ปิด
    0.46
     deklar
    0.45
     defocus
    0.45
    ന്ത്രാ
    0.45
    Mass
    0.44
     clásico
    0.44
     omnis
    0.44
    Encoding
    0.43
    POSITIVE LOGITS
     Gulf
    0.43
     trouble
    0.41
     ailing
    0.41
    々な
    0.40
     Trouble
    0.39
     Luffy
    0.39
     grapefruit
    0.38
     faulty
    0.38
     Lemon
    0.38
     Krankheit
    0.38
    Act Density 0.004%

    No Known Activations