INDEX
    Explanations

    instances of the word "split" and its variations

    New Auto-Interp
    Negative Logits
    het
    -0.17
    ensen
    -0.16
    WS
    -0.16
    hay
    -0.15
    416
    -0.15
    erli
    -0.15
    sis
    -0.15
    urement
    -0.15
     Pace
    -0.15
    hm
    -0.15
    POSITIVE LOGITS
    ting
    0.37
    tered
    0.30
    tering
    0.26
    TING
    0.25
    ter
    0.25
    .Split
    0.23
    (split
    0.22
     apart
    0.22
    tring
    0.22
    TERS
    0.21
    Act Density 0.021%

    No Known Activations