INDEX
    Explanations

    references to historical figures and their impact on societal issues

    New Auto-Interp
    Negative Logits
    afari
    -0.15
    (íģ¬ê¸°
    -0.15
     Chow
    -0.15
    alendar
    -0.14
    ihilation
    -0.14
    æ®Ĭ
    -0.14
     å¾
    -0.14
    ormsg
    -0.14
    lament
    -0.14
    outil
    -0.14
    POSITIVE LOGITS
     Fol
    0.17
     Freel
    0.17
    fol
    0.16
    rek
    0.15
     Americans
    0.15
    123
    0.15
     AUD
    0.15
    258
    0.14
     folks
    0.14
    mal
    0.14
    Act Density 0.027%

    No Known Activations