INDEX
    Explanations

    references to communities or communism

    New Auto-Interp
    Negative Logits
    ľ
    -2.38
    č↵č↵   
    -2.28
                                                                                  
    -2.28
    -2.28
    <|outofrange|>
    -2.28
    č↵č↵       
    -2.28
                                       
    -2.28
    ↵    ↵   
    -2.28
    -2.28
    -2.28
    POSITIVE LOGITS
    icable
    1.84
    ITED
    1.84
    shot
    1.73
    imental
    1.72
    ICT
    1.66
    ications
    1.63
    ication
    1.60
    ICES
    1.59
    ision
    1.52
    icator
    1.51
    Act Density 0.245%

    No Known Activations