INDEX
    Explanations

    content that challenges societal norms or beliefs regarding race and history

    New Auto-Interp
    Negative Logits
     purpoſe
    -0.58
     ſtate
    -0.55
     houſe
    -0.55
     XE
    -0.55
     eſt
    -0.55
     ſch
    -0.54
    ſelves
    -0.53
     efe
    -0.52
     pleaſure
    -0.51
     CreateTagHelper
    -0.51
    POSITIVE LOGITS
    RegressionTest
    0.72
    ChromeDriver
    0.70
    aarrggbb
    0.68
    EDEFAULT
    0.65
    bootstrapcdn
    0.56
     [*]
    0.55
    0.50
     jScrollPane
    0.49
     somehow
    0.49
     pinulongan
    0.47
    Act Density 0.469%

    No Known Activations