INDEX
    Explanations

    mentions of the word "turkey."

    New Auto-Interp
    Negative Logits
    ardo
    -0.79
    iott
    -0.75
    iam
    -0.74
    ingly
    -0.72
     largeDownload
    -0.70
    ister
    -0.69
    ances
    -0.68
    orius
    -0.68
    esis
    -0.66
    inel
    -0.66
    POSITIVE LOGITS
    geon
    0.80
    geons
    0.80
    gie
    0.75
    STEM
    0.74
    Rex
    0.72
     Sabres
    0.71
    gery
    0.69
    BIL
    0.69
     Nug
    0.65
    pora
    0.64
    Act Density 0.029%

    No Known Activations