INDEX
    Explanations

    references to the Harry Potter series and its main actor, Daniel Radcliffe

    New Auto-Interp
    Negative Logits
    NC
    -0.17
    ammers
    -0.17
    857
    -0.15
    igli
    -0.15
    UILTIN
    -0.15
     Fut
    -0.14
     Nationals
    -0.14
     سÙĨت
    -0.14
    ruba
    -0.14
    356
    -0.14
    POSITIVE LOGITS
     Hogwarts
    0.23
     Harry
    0.22
    Harry
    0.20
     Rowling
    0.20
     Voldemort
    0.19
     Snape
    0.17
    ucha
    0.17
     HP
    0.16
    umbledore
    0.16
     spells
    0.16
    Act Density 0.073%

    No Known Activations