INDEX
    Explanations

    comparisons between different items

    New Auto-Interp
    Negative Logits
     தடை
    0.46
     отсут
    0.45
    ebabkan
    0.43
    超时
    0.42
     其实
    0.42
    atthena
    0.41
    iasco
    0.40
    传统
    0.40
     Weltkrieg
    0.40
     ක්ර
    0.39
    POSITIVE LOGITS
     identical
    0.68
     different
    0.66
     разных
    0.58
    identical
    0.57
     identically
    0.56
     diferentes
    0.55
     unterschied
    0.55
     individuales
    0.55
     similares
    0.55
     разные
    0.55
    Act Density 0.130%

    No Known Activations