INDEX
    Explanations

    politics and government

    New Auto-Interp
    Negative Logits
    .literal
    -0.06
    ustralian
    -0.06
    Laughs
    -0.06
    .Volume
    -0.06
    Designer
    -0.06
     Vaugh
    -0.06
    op
    -0.06
    ушка
    -0.06
     jednou
    -0.06
    -0.06
    POSITIVE LOGITS
    jmu
    0.07
    =#
    0.07
     Homo
    0.07
     SITE
    0.07
    ाइक
    0.06
    .ak
    0.06
    кій
    0.06
    ustry
    0.06
    	std
    0.06
    posure
    0.06
    Act Density 0.077%

    No Known Activations