INDEX
    Explanations

    sentences expressing concerns about political integrity and foreign interference in elections

    New Auto-Interp
    Negative Logits
    wußt
    -0.72
    BlockPos
    -0.67
     Barbier
    -0.59
    íts
    -0.58
    hofen
    -0.57
    ffilm
    -0.56
    Спољашње
    -0.56
    kün
    -0.56
    ughter
    -0.55
    tagHelperRunner
    -0.55
    POSITIVE LOGITS
     myſelf
    0.54
    发表于
    0.53
     <<-
    0.53
    ModelSerializer
    0.51
     becauſe
    0.51
    FunctionFlags
    0.50
     Monfieur
    0.49
     myself
    0.47
     NLI
    0.46
     raiſ
    0.46
    Act Density 0.285%

    No Known Activations