INDEX
Explanations
references to the television series "Doctor Who" and associated character names
New Auto-Interp
Negative Logits
egin
-0.17
Formation
-0.15
342
-0.14
rita
-0.14
jee
-0.14
formation
-0.14
æµģ
-0.14
lfw
-0.14
stry
-0.14
awan
-0.13
POSITIVE LOGITS
vais
0.17
agos
0.16
Lâm
0.14
enis
0.14
лоÑĢ
0.14
orst
0.13
обла
0.13
_Do
0.13
enheim
0.13
ê·¹
0.13
Activations Density 0.034%