INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     counselor
    -0.34
    riever
    -0.30
     SwitchCompat
    -0.29
     nonprofit
    -0.28
     counseling
    -0.28
    .
    -0.28
     favor
    -0.28
     counselors
    -0.27
     labor
    -0.27
     Counselor
    -0.26
    POSITIVE LOGITS
     AssemblyTitle
    0.80
    bibfield
    0.76
    BibitemShut
    0.76
    ulongan
    0.71
     perchè
    0.71
     poichè
    0.70
     ligiloj
    0.69
    0.68
     Honourable
    0.65
     Amongst
    0.65
    Act Density 0.003%

    No Known Activations