INDEX
    Explanations

    occurrences of text encoding characters

    words or phrases related to perceptions and interpretations, particularly in the context of societal issues

    New Auto-Interp
    Negative Logits
     misunder
    -0.71
     JPEG
    -0.70
     decomp
    -0.64
     unofficial
    -0.63
     advis
    -0.62
     Mobil
    -0.62
     cellphone
    -0.62
     unidentified
    -0.61
     conspicuous
    -0.61
     retirees
    -0.61
    POSITIVE LOGITS
    s
    1.32
    tion
    1.05
    _.
    1.04
    til
    1.03
    tis
    1.00
    sit
    0.99
    sis
    0.98
    ski
    0.96
    swer
    0.95
    cause
    0.95
    Act Density 0.223%

    No Known Activations