INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     proving
    0.75
    으나
    0.73
     намере
    0.71
     كور
    0.69
    intention
    0.67
     intention
    0.67
     ஏற்படுகிறது
    0.67
     مؤثر
    0.66
     intención
    0.65
     mentoring
    0.64
    POSITIVE LOGITS
     familiar
    2.66
     Familiar
    2.25
    familiar
    2.19
     familiarity
    2.03
    熟悉
    1.82
     familiarize
    1.77
    熟悉的
    1.73
    amiliar
    1.68
     acquainted
    1.62
     знако
    1.52
    Act Density 0.268%

    No Known Activations