INDEX
Explanations
words ending with '-ator' or '-iator'
words related to various roles, positions, and titles associated with authorship or agency
New Auto-Interp
Negative Logits
ITNESS
-0.83
lyak
-0.75
erton
-0.68
lez
-0.68
erest
-0.67
OVER
-0.66
soDeliveryDate
-0.64
MER
-0.62
ness
-0.61
endor
-0.61
POSITIVE LOGITS
ially
1.37
ium
1.21
ial
1.16
SHIP
1.16
hips
1.09
IAL
1.01
IUM
0.95
io
0.91
ials
0.90
iate
0.88
Activations Density 0.143%