INDEX
    Explanations

    Auxiliary verbs

    New Auto-Interp
    Negative Logits
    ละ
    -0.07
    Required
    -0.06
     Radical
    -0.06
    fold
    -0.06
    iang
    -0.06
    Fun
    -0.06
    ąż
    -0.06
    hof
    -0.06
     unofficial
    -0.06
     Whats
    -0.06
    POSITIVE LOGITS
     pem
    0.07
     jealousy
    0.06
    ा:
    0.06
    _PIX
    0.06
    ,,,
    0.06
    -inst
    0.06
    ESIS
    0.06
    -prom
    0.06
     cara
    0.06
    0.06
    Act Density 0.087%

    No Known Activations