INDEX
    Explanations

    words related to animals and their breeding

    New Auto-Interp
    Negative Logits
    abelle
    -0.17
    ope
    -0.17
    aneous
    -0.17
    ose
    -0.16
    opa
    -0.15
    hod
    -0.15
    BC
    -0.15
    ase
    -0.15
    ugs
    -0.15
    arrow
    -0.14
    POSITIVE LOGITS
    jamin
    0.27
    emer
    0.19
    friend
    0.19
    .gdx
    0.19
    quets
    0.19
    ÑģÑıÑĤ
    0.18
    iful
    0.18
     sexes
    0.17
    elor
    0.16
    lava
    0.16
    Act Density 2.403%

    No Known Activations