INDEX
    Explanations

    references to historical and literary figures, particularly those associated with social commentary and critique

    New Auto-Interp
    Negative Logits
    Incoming
    -0.51
    erville
    -0.51
     lyder
    -0.50
    iterranean
    -0.49
    enf
    -0.49
    воло
    -0.47
    utsch
    -0.47
     slutt
    -0.46
    FontStyle
    -0.46
    oudou
    -0.45
    POSITIVE LOGITS
    цездатний
    0.77
    تقاوى
    0.75
     Italijani
    0.73
     disambiguazione
    0.72
    WebElementEntity
    0.71
     يتيمه
    0.70
    kegaard
    0.69
     estekak
    0.67
     Normdatei
    0.66
    LookAnd
    0.65
    Act Density 0.216%

    No Known Activations