INDEX
    Explanations

    mentions of the word "who" in various contexts

    New Auto-Interp
    Negative Logits
    ardown
    -0.15
    illes
    -0.15
    yen
    -0.15
    å¼ķãģį
    -0.14
    gaard
    -0.14
    å¹ķ
    -0.13
    ιÏĥ
    -0.13
    ÑĢазд
    -0.13
    èĦļ
    -0.13
    SURE
    -0.13
    POSITIVE LOGITS
    oping
    0.17
    upon
    0.16
    akin
    0.15
    endale
    0.15
    tridge
    0.14
    akes
    0.14
    osh
    0.14
    annel
    0.14
    воÑĢ
    0.13
     sinc
    0.13
    Act Density 0.122%

    No Known Activations