INDEX
    Explanations

    phrases indicating proximity or position

    New Auto-Interp
    Negative Logits
    /gpl
    -0.15
    Henry
    -0.15
    usercontent
    -0.15
    mgr
    -0.15
    лав
    -0.15
    anter
    -0.14
     GPLv
    -0.14
    æĴ®
    -0.14
    antha
    -0.13
    uters
    -0.13
    POSITIVE LOGITS
    iline
    0.16
     nhau
    0.16
    ä¹İ
    0.15
    /on
    0.15
    íijľ
    0.15
     Gaines
    0.15
     -*-č↵
    0.14
    /about
    0.14
    orts
    0.14
    inder
    0.14
    Act Density 0.039%

    No Known Activations