INDEX
    Explanations

    topics related to social and political issues

    New Auto-Interp
    Negative Logits
     with
    -0.42
     vỼi
    -0.35
    with
    -0.35
     dengan
    -0.34
    	with
    -0.32
    swith
    -0.30
     avec
    -0.29
    ewith
    -0.27
     withString
    -0.27
    _with
    -0.26
    POSITIVE LOGITS
     intact
    0.32
     having
    0.29
     being
    0.27
     thrown
    0.26
     included
    0.24
     remaining
    0.23
    having
    0.23
    being
    0.23
     added
    0.22
     sendo
    0.21
    Act Density 0.513%

    No Known Activations