INDEX
Explanations
references to the Harry Potter series and its main actor, Daniel Radcliffe
New Auto-Interp
Negative Logits
NC
-0.17
ammers
-0.17
857
-0.15
igli
-0.15
UILTIN
-0.15
Fut
-0.14
Nationals
-0.14
سÙĨت
-0.14
ruba
-0.14
356
-0.14
POSITIVE LOGITS
Hogwarts
0.23
Harry
0.22
Harry
0.20
Rowling
0.20
Voldemort
0.19
Snape
0.17
ucha
0.17
HP
0.16
umbledore
0.16
spells
0.16
Activations Density 0.073%