INDEX
    Explanations

    references to political leadership and election-related terminology

    New Auto-Interp
    Negative Logits
    }}],
    -0.82
    }))
    
    -0.73
    ergies
    -0.72
     Italijanski
    -0.70
    hyrchwyd
    -0.68
    MessageTagHelper
    -0.68
     utafitiHapana
    -0.67
    WSGI
    -0.67
    HostException
    -0.66
     Affiliations
    -0.66
    POSITIVE LOGITS
     chosen
    0.86
    Chosen
    0.81
     general
    0.78
    select
    0.71
     select
    0.71
     favored
    0.71
     selected
    0.70
     Chosen
    0.69
    chosen
    0.67
     goal
    0.67
    Act Density 0.184%

    No Known Activations