INDEX
    Explanations

    phrases indicating concern for the well-being of oneself and others

    phrases related to personal relationships and connections

    New Auto-Interp
    Negative Logits
    igun
    -0.65
    etting
    -0.64
    代
    -0.63
    geoning
    -0.62
    ettel
    -0.61
    Contin
    -0.59
     guiActiveUnfocused
    -0.59
    imal
    -0.59
    dayName
    -0.58
     Gleaming
    -0.58
    POSITIVE LOGITS
     others
    1.21
     yours
    1.07
     theirs
    0.93
     ours
    0.88
     everyone
    0.88
     your
    0.88
     anyone
    0.85
     our
    0.84
     my
    0.82
    your
    0.81
    Act Density 0.110%

    No Known Activations