INDEX
    Explanations

    proper nouns or names, specifically focused on the name "Troy"

    mentions of the word "Troy."

    New Auto-Interp
    Negative Logits
    INESS
    -0.81
    awar
    -0.75
    pmwiki
    -0.73
    iquid
    -0.72
    ãĥĥãĤ¯
    -0.70
    ubric
    -0.69
    ivia
    -0.68
     pity
    -0.68
    merga
    -0.67
    ivities
    -0.67
    POSITIVE LOGITS
    alty
    0.91
    alties
    0.90
    ij士
    0.81
    neau
    0.79
    al
    0.77
    imet
    0.75
    don
    0.75
    tics
    0.74
    er
    0.73
    nce
    0.72
    Act Density 0.032%

    No Known Activations