INDEX
Explanations
references to authorship and author-related terms
New Auto-Interp
Negative Logits
yor
-0.17
-eyed
-0.16
eyes
-0.16
ey
-0.15
βά
-0.15
ow
-0.15
125
-0.14
-haired
-0.14
ingly
-0.14
eyed
-0.14
POSITIVE LOGITS
ship
0.17
UPPORTED
0.17
ãĤ¹ãĥ¬
0.17
lient
0.16
avia
0.16
YSTEM
0.14
iyet
0.14
Scheduled
0.14
upported
0.13
ìĭŃ
0.13
Activations Density 0.027%