INDEX
    Explanations

    phrases related to relationships and accountability

    New Auto-Interp
    Negative Logits
    cium
    -0.67
    Contin
    -0.67
    AX
    -0.66
    oids
    -0.59
    enium
    -0.59
    otropic
    -0.59
    ombat
    -0.58
    ibu
    -0.57
    utt
    -0.57
    iband
    -0.56
    POSITIVE LOGITS
     theirs
    1.86
     hers
    1.67
     yours
    1.51
     ours
    1.49
     mine
    1.31
    his
    1.25
    their
    1.22
    your
    1.18
     his
    1.17
     your
    1.12
    Act Density 3.047%

    No Known Activations