INDEX
    Explanations

    references to important familial and social relationships

    New Auto-Interp
    Negative Logits
    ebek
    -0.17
    kate
    -0.15
    ewire
    -0.15
    ramework
    -0.14
    uum
    -0.14
    ivery
    -0.14
    è¦
    -0.14
    nave
    -0.14
    efore
    -0.14
    mium
    -0.14
    POSITIVE LOGITS
     nor
    0.57
     Nor
    0.38
    nor
    0.37
    Nor
    0.34
     anymore
    0.34
     NOR
    0.29
     or
    0.24
     sondern
    0.20
     unless
    0.20
     necessarily
    0.20
    Act Density 0.145%

    No Known Activations