INDEX
    Explanations

    names of people and locations, particularly in political contexts

    New Auto-Interp
    Negative Logits
    ccording
    -0.64
    shock
    -0.63
    amazon
    -0.63
    ecd
    -0.60
    tackle
    -0.59
    spin
    -0.59
    agra
    -0.59
    Leaks
    -0.59
    duino
    -0.59
     defic
    -0.58
    POSITIVE LOGITS
     ,
    1.61
     .
    1.50
     ;
    1.44
     ).
    1.42
     :
    1.38
     .)
    1.37
     )
    1.36
     ),
    1.35
     ._
    1.32
     ,"
    1.32
    Act Density 0.047%

    No Known Activations