INDEX
    Explanations

    key names and terms related to societal commentary and critique

    New Auto-Interp
    Negative Logits
    odo
    -0.10
     understandably
    -0.08
    ãģłãĤįãģĨ
    -0.07
    ODO
    -0.06
     mÃ¼ÅŁ
    -0.06
     alone
    -0.06
    avenport
    -0.06
    ,...↵↵
    -0.06
    arga
    -0.06
    ноÑĩ
    -0.06
    POSITIVE LOGITS
     actually
    0.26
    actually
    0.22
     Actually
    0.21
    Actually
    0.19
     actual
    0.18
    å®ŀéĻħ
    0.17
     aslında
    0.15
    åħ¶å®ŀ
    0.15
    actual
    0.15
     reality
    0.15
    Act Density 0.008%

    No Known Activations