INDEX
Explanations
references to the Harry Potter series
references to the "Harry Potter" series and its characters
New Auto-Interp
Negative Logits
horizont
-0.64
Rockefeller
-0.63
resp
-0.62
oples
-0.62
bay
-0.60
vre
-0.60
asio
-0.60
WARD
-0.59
Low
-0.59
IGN
-0.59
POSITIVE LOGITS
Potter
1.01
haus
0.86
Dumbledore
0.85
wra
0.84
more
0.81
Rings
0.79
Gand
0.78
olkien
0.77
ebook
0.76
Granger
0.76
Activations Density 0.063%