INDEX
    Explanations

    independent

    New Auto-Interp
    Negative Logits
     Scor
    -0.08
     area
    -0.07
    496
    -0.07
     call
    -0.07
    umes
    -0.07
     Gaul
    -0.07
    oma
    -0.06
     sexual
    -0.06
     nấu
    -0.06
     Mostly
    -0.06
    POSITIVE LOGITS
     independent
    0.13
     Independent
    0.12
     Independence
    0.11
     independence
    0.11
    Independent
    0.11
     independ
    0.10
     independents
    0.10
    0.10
    Independ
    0.09
     independently
    0.09
    Act Density 0.015%

    No Known Activations