INDEX
    Explanations

    references to the word "this" in various contexts

    New Auto-Interp
    Negative Logits
    DCF
    -0.16
    /includes
    -0.15
    reste
    -0.14
    ÎŃαÏĤ
    -0.14
     nackt
    -0.14
    aty
    -0.14
    issy
    -0.13
     Abr
    -0.13
    vero
    -0.13
    Į¨
    -0.13
    POSITIVE LOGITS
     gem
    0.19
     little
    0.17
     âĨĵ
    0.17
    :↵
    0.16
     воÑĤ
    0.15
    âĨĵ
    0.15
    inesis
    0.15
    šak
    0.15
     beaut
    0.14
     recent
    0.14
    Act Density 0.093%

    No Known Activations