INDEX
    Explanations

    references to Nazi-related terms and concepts

    references to the Nazi regime and its associated historical context

    New Auto-Interp
    Negative Logits
    pole
    -0.78
    tis
    -0.74
    Interstitial
    -0.74
    pring
    -0.73
    20439
    -0.72
    Dub
    -0.72
    notes
    -0.71
    player
    -0.68
    area
    -0.67
    forward
    -0.67
    POSITIVE LOGITS
     Hitler
    0.98
    ocaust
    0.88
     Germany
    0.87
    chwitz
    0.86
     Youth
    0.85
     salute
    0.84
     Holocaust
    0.84
    wald
    0.80
     swast
    0.79
     Nazi
    0.79
    Act Density 0.058%

    No Known Activations