INDEX
    Explanations

    special characters or unusual symbols within the text

    New Auto-Interp
    Negative Logits
     Thur
    -0.14
    rál
    -0.14
    оÑģÑĥд
    -0.14
    UIL
    -0.14
    .mag
    -0.14
    ãĥ©ãĤ¹
    -0.14
    nout
    -0.14
    lient
    -0.14
     UIL
    -0.13
     dime
    -0.13
    POSITIVE LOGITS
     production
    0.28
    production
    0.23
     Production
    0.23
    -production
    0.21
    Production
    0.21
    _production
    0.20
     produ
    0.20
    produ
    0.20
    .production
    0.20
    çĶŁäº§
    0.19
    Act Density 0.010%

    No Known Activations