INDEX
    Explanations

    references to unique or specific categories or types within a context

    New Auto-Interp
    Negative Logits
    annel
    -0.17
     pau
    -0.16
    erva
    -0.15
     Kür
    -0.15
     Äijây
    -0.15
    ipp
    -0.14
    porto
    -0.14
    ÃŃm
    -0.14
    -âĢIJ
    -0.14
    LETE
    -0.13
    POSITIVE LOGITS
     its
    0.39
    its
    0.30
     Its
    0.29
    åħ¶
    0.28
    Its
    0.27
     his
    0.26
     seus
    0.25
     sua
    0.25
     their
    0.24
     seu
    0.24
    Act Density 0.140%

    No Known Activations