INDEX
    Explanations

    terms related to genetics and ethical considerations

    New Auto-Interp
    Negative Logits
     yet
    -0.24
    Yet
    -0.23
     Yet
    -0.22
     however
    -0.20
     HOWEVER
    -0.20
    yet
    -0.20
     ONLY
    -0.18
     However
    -0.17
    fi
    -0.17
     Though
    -0.16
    POSITIVE LOGITS
     sino
    0.36
     بÙĦÚ©Ùĩ
    0.34
     sondern
    0.32
     nor
    0.27
    also
    0.23
     also
    0.21
     Nor
    0.20
    ï¼Įä¹Ł
    0.19
    Nor
    0.19
    ï¼ĮèĢĮä¸Ķ
    0.19
    Act Density 0.067%

    No Known Activations