INDEX
    Explanations

    phrases or constructions indicating possession or ownership

    New Auto-Interp
    Negative Logits
    .gs
    -0.17
     suggestion
    -0.15
    ona
    -0.15
    âĻ
    -0.15
     Alexandria
    -0.15
    erti
    -0.14
    ypy
    -0.14
    ãĥ³ãĤ¹
    -0.14
    erah
    -0.14
    nox
    -0.14
    POSITIVE LOGITS
    bilt
    0.17
    æĴ®
    0.15
    thane
    0.15
     åĥ
    0.15
    à¥ĭम
    0.15
    Brun
    0.14
    isson
    0.14
    riott
    0.14
    ieties
    0.14
     Yates
    0.13
    Act Density 0.015%

    No Known Activations