INDEX
    Explanations

    references to prominent individuals or organizations in political contexts

    expressions of capability and perceived actions

    New Auto-Interp
    Negative Logits
    Become
    -1.03
     Become
    -1.01
     Becoming
    -0.99
    become
    -0.97
    Becoming
    -0.96
     becoming
    -0.96
     become
    -0.95
     becomes
    -0.94
     Becomes
    -0.93
    becoming
    -0.90
    POSITIVE LOGITS
     put
    0.67
     assign
    0.65
     introduce
    0.60
     introducing
    0.58
    oa̍t
    0.57
    0.56
     entrust
    0.55
     treating
    0.55
     allocate
    0.54
     把
    0.54
    Act Density 2.266%

    No Known Activations