INDEX
    Explanations

    the presence of the word "there" in various contexts

    New Auto-Interp
    Negative Logits
    ente
    -0.16
    rote
    -0.15
    enge
    -0.13
     e
    -0.13
    ÄĻ
    -0.13
    ECT
    -0.13
    erli
    -0.13
    ografie
    -0.13
     boards
    -0.13
    iris
    -0.13
    POSITIVE LOGITS
    's
    0.23
    ’s
    0.20
    İÅŀ
    0.19
    ain
    0.16
    eyin
    0.15
     simply
    0.15
     honestly
    0.14
     ya
    0.14
    xbd
    0.14
    İS
    0.14
    Act Density 0.087%

    No Known Activations