INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _disc
    -0.08
     billions
    -0.07
     Leigh
    -0.07
     invis
    -0.07
     Wolver
    -0.06
    แส
    -0.06
    _multip
    -0.06
     پژوهش
    -0.06
    _REL
    -0.06
    _head
    -0.06
    POSITIVE LOGITS
     customs
    0.14
     Customs
    0.13
     customary
    0.10
     custom
    0.08
    common
    0.08
     accustomed
    0.07
    ustomed
    0.07
     обы
    0.07
    omatic
    0.07
     des
    0.07
    Act Density 0.003%

    No Known Activations