INDEX
    Explanations

    sentences that emphasize unity and common ground among individuals, often using the context of discussing differences in background, beliefs, or preferences

    New Auto-Interp
    Negative Logits
    umbnail
    -0.68
    ociate
    -0.65
    luaj
    -0.64
    iling
    -0.63
    76561
    -0.63
    ertodd
    -0.62
    izons
    -0.59
    rongh
    -0.59
    ukong
    -0.59
    nel
    -0.58
    POSITIVE LOGITS
     raining
    1.27
     unclear
    1.26
     impossible
    1.19
     imperative
    1.14
     easier
    1.11
     ironic
    1.07
     doubtful
    1.05
     advisable
    1.05
     easy
    1.04
     conceivable
    1.03
    Act Density 3.005%

    No Known Activations