INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    GEBURTSDATUM
    -0.60
     ویکی‌پدی
    -0.60
     المعيارى
    -0.53
    MigrationBuilder
    -0.46
     صوتيه
    -0.46
    хьтан
    -0.43
    Tikang
    -0.43
     ligiloj
    -0.41
    pren
    -0.40
    dbach
    -0.39
    POSITIVE LOGITS
     same
    0.70
     exact
    0.58
     identical
    0.58
     medesimo
    0.56
     gleichen
    0.55
     саме
    0.55
     samego
    0.55
    SAME
    0.54
     very
    0.54
     SAME
    0.53
    Act Density 0.007%

    No Known Activations