INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    LabelTagHelper
    -0.43
     MacDonald
    -0.43
     ICS
    -0.43
     Saunders
    -0.43
     Browne
    -0.42
    Markus
    -0.41
     Markus
    -0.41
     φ
    -0.41
    nas
    -0.41
    in
    -0.40
    POSITIVE LOGITS
     Utah
    2.39
    Utah
    2.25
     utah
    1.90
    utah
    1.52
     UTA
    1.06
     Uta
    1.01
     Mormons
    0.88
     Mormon
    0.88
     Salt
    0.86
    Salt
    0.82
    Act Density 0.007%

    No Known Activations