INDEX
    Explanations

    names of specific individuals

    mentions of specific individuals, particularly focusing on the name "Zimmer."

    New Auto-Interp
    Negative Logits
    govtrack
    -0.74
    seeking
    -0.69
    obbies
    -0.69
    esting
    -0.69
    ulton
    -0.65
    uckland
    -0.62
    isan
    -0.62
    luent
    -0.61
    reating
    -0.61
    ridges
    -0.61
    POSITIVE LOGITS
     Zimmer
    1.01
    mann
    0.84
    mans
    0.81
    mented
    0.78
    lins
    0.75
    lin
    0.73
    schild
    0.72
    wald
    0.67
    mens
    0.67
    ngth
    0.66
    Act Density 0.014%

    No Known Activations