INDEX
    Explanations

    references to historical political figures, particularly Joseph Stalin

    mentions of Joseph Stalin

    New Auto-Interp
    Negative Logits
    lihood
    -1.12
    es
    -0.84
    manship
    -0.79
    LEY
    -0.78
    wich
    -0.75
    git
    -0.73
    wall
    -0.70
     Indies
    -0.70
    chool
    -0.70
    verning
    -0.68
    POSITIVE LOGITS
    itri
    0.85
    éĹĺ
    0.85
    arily
    0.81
    ategory
    0.75
    apse
    0.74
    umenthal
    0.73
    istically
    0.70
    ascus
    0.69
    ossibility
    0.69
    emetery
    0.69
    Act Density 0.027%

    No Known Activations