INDEX
    Explanations

    words related to temporal concepts and transitions

    New Auto-Interp
    Negative Logits
    ÙĬدة
    -0.15
    ustos
    -0.15
    ENTA
    -0.15
    ein
    -0.14
    ughs
    -0.14
    úa
    -0.14
    عÙħ
    -0.14
    OOK
    -0.14
    áp
    -0.14
    pike
    -0.13
    POSITIVE LOGITS
    ness
    0.21
    heid
    0.20
    çļĦæĺ¯
    0.17
    nr
    0.16
    IDAD
    0.15
    keiten
    0.15
    ITIES
    0.15
    ität
    0.15
    igkeit
    0.15
    'T
    0.14
    Act Density 0.047%

    No Known Activations