INDEX
    Explanations

    phrases related to unity and community

    New Auto-Interp
    Negative Logits
     gratification
    -0.77
     quo
    -0.66
     totality
    -0.65
     UD
    -0.64
     citation
    -0.63
     anonymity
    -0.59
     LSD
    -0.59
     Publication
    -0.58
     srfAttach
    -0.56
     citations
    -0.56
    POSITIVE LOGITS
    bsite
    1.10
    bley
    1.10
    akening
    1.05
    athered
    1.05
    ird
    1.05
    ldon
    1.04
    asel
    1.01
    eks
    1.01
    aving
    1.00
    igh
    0.99
    Act Density 8.972%

    No Known Activations