INDEX
    Explanations

    instances of the word "it" and its variations, indicating a focus on pronouns and their contextual usage

    New Auto-Interp
    Negative Logits
    olle
    -0.16
    elez
    -0.15
     RCA
    -0.15
    undler
    -0.15
    agn
    -0.14
    ebi
    -0.14
    nk
    -0.14
    .Modules
    -0.13
    unbind
    -0.13
     Platt
    -0.13
    POSITIVE LOGITS
     pivot
    0.14
    åħĦå¼Ł
    0.14
     Bid
    0.14
    Scaler
    0.14
    MQ
    0.14
    urma
    0.14
    dux
    0.13
     den
    0.13
    heiro
    0.13
    iggins
    0.13
    Act Density 0.850%

    No Known Activations