INDEX
    Explanations

    the word "Roeder", as it appears multiple times with high activations

    words related to confederation or entities with "Feder" in them

    New Auto-Interp
    Negative Logits
     smartphones
    -0.65
     clarity
    -0.63
     summons
    -0.63
     Floor
    -0.62
     overtime
    -0.61
     satisfaction
    -0.59
     slowdown
    -0.59
     haze
    -0.59
     360
    -0.57
     gears
    -0.57
    POSITIVE LOGITS
    eder
    4.52
    ederation
    2.02
    eding
    1.56
    eds
    1.34
    ede
    1.30
    ederal
    1.23
     Feder
    1.22
     Confeder
    1.15
    edes
    1.09
    rer
    1.04
    Act Density 0.006%

    No Known Activations