INDEX
    Explanations

    discussions surrounding accountability and the consequences of actions or policies

    New Auto-Interp
    Negative Logits
    _buffers
    -0.13
    plusplus
    -0.13
    .PerformLayout
    -0.13
    олÑĮз
    -0.13
    :CGRect
    -0.13
    åŃĺäºİ
    -0.12
    Ñĩе
    -0.12
     accompagn
    -0.12
    ambiguous
    -0.12
    .sponge
    -0.12
    POSITIVE LOGITS
     shows
    0.41
     indicates
    0.36
    shows
    0.35
     demonstrates
    0.35
     show
    0.34
     indicate
    0.34
     demonstrate
    0.34
     Shows
    0.34
    Shows
    0.31
     highlights
    0.30
    Act Density 0.789%

    No Known Activations