INDEX
    Explanations

    patterns of consistency and similarity in systems or products

    New Auto-Interp
    Negative Logits
     separate
    -0.15
    à¹Ģà¸Ł
    -0.14
    jac
    -0.14
     Beng
    -0.14
    ipa
    -0.14
     separately
    -0.14
     Beard
    -0.13
    ew
    -0.13
     double
    -0.13
    ance
    -0.13
    POSITIVE LOGITS
     identical
    0.40
     uniform
    0.40
    缸åIJĮ
    0.32
    Uniform
    0.32
    uniform
    0.31
     Uniform
    0.30
     ident
    0.28
     ëıĻìĿ¼
    0.27
    _same
    0.26
     alike
    0.25
    Act Density 0.261%

    No Known Activations