INDEX
    Explanations

    content related to political figures and policies

    New Auto-Interp
    Negative Logits
     Canaver
    -0.53
    estern
    -0.46
    odcast
    -0.38
    Picture
    -0.36
     PARK
    -0.36
    podcast
    -0.35
     Patreon
    -0.35
     Anonymous
    -0.35
     âĢº
    -0.34
     Geek
    -0.34
    POSITIVE LOGITS
    .).
    0.80
    )).
    0.77
    ?).
    0.70
    ).[
    0.70
    )."
    0.68
    ]."
    0.67
    ).
    0.67
    }.
    0.65
    ]).
    0.62
    %).
    0.61
    Act Density 18.821%

    No Known Activations