INDEX
    Explanations

    references to familial concepts or relationships

    New Auto-Interp
    Negative Logits
    stav
    -0.18
    .xhtml
    -0.15
    yw
    -0.15
    Mate
    -0.15
    ĭ
    -0.14
    yms
    -0.14
    iffe
    -0.14
    .beh
    -0.14
    èģ
    -0.14
    inia
    -0.14
    POSITIVE LOGITS
    iglia
    0.33
    ÃŃlia
    0.28
    iliar
    0.24
    ili
    0.24
    ously
    0.23
    ished
    0.22
    iger
    0.21
    OUS
    0.21
    igli
    0.20
    ulus
    0.20
    Act Density 0.004%

    No Known Activations