INDEX
    Explanations

    politically and legally charged words and phrases

    terms associated with issues, questions, and concepts related to problems or debates

    New Auto-Interp
    Negative Logits
    rower
    -0.74
    cale
    -0.69
    stem
    -0.68
    ync
    -0.64
    erver
    -0.64
     Observatory
    -0.63
    hirt
    -0.62
     Splash
    -0.61
    ource
    -0.60
    creen
    -0.60
    POSITIVE LOGITS
    lessly
    1.07
    less
    0.91
    ishly
    0.89
    ually
    0.89
    ally
    0.88
    ably
    0.87
    atical
    0.86
    ically
    0.84
    arily
    0.83
    ily
    0.81
    Act Density 0.381%

    No Known Activations