INDEX
    Explanations

    instances of the word "that" and its various forms, indicating a focus on explanatory or defining statements

    New Auto-Interp
    Negative Logits
    .datas
    -0.15
    ãĥ«ãĤ¯
    -0.15
    è¿Ļä¹Ī
    -0.15
    irsch
    -0.14
    ä
    -0.14
    ÑĪев
    -0.14
    legate
    -0.14
    imer
    -0.14
    arpa
    -0.14
    алеж
    -0.14
    POSITIVE LOGITS
    urdu
    0.15
    ceed
    0.15
    оÑĢод
    0.14
    itel
    0.14
    257
    0.13
    andır
    0.13
    ango
    0.13
    /reset
    0.13
     whole
    0.13
    ohl
    0.13
    Act Density 0.077%

    No Known Activations