INDEX
    Explanations

    terms related to regulations and laws, including categories like race, gender, religion, and national origin

    items and characteristics that are typically listed or categorized

    New Auto-Interp
    Negative Logits
    herent
    -0.70
    ¬¼
    -0.69
    ame
    -0.64
    ername
    -0.61
    irlf
    -0.61
    icol
    -0.60
    iaries
    -0.59
     fres
    -0.58
    ocese
    -0.58
    ugi
    -0.57
    POSITIVE LOGITS
     etc
    1.04
     albeit
    0.95
    etc
    0.74
     namely
    0.69
     which
    0.69
    76561
    0.69
     however
    0.69
     whereas
    0.66
     although
    0.66
     but
    0.65
    Act Density 0.796%

    No Known Activations