INDEX
Explanations
references to seasons and episodes of television shows
New Auto-Interp
Negative Logits
decades
-0.23
centuries
-0.20
-century
-0.17
century
-0.16
alis
-0.16
trad
-0.16
longstanding
-0.15
Trad
-0.15
Cent
-0.14
Trad
-0.14
POSITIVE LOGITS
inaugural
0.32
second
0.26
第äºĮ
0.23
second
0.22
original
0.20
第äºĮ
0.20
SECOND
0.20
segunda
0.19
SECOND
0.19
ikinci
0.18
Activations Density 0.233%