INDEX
    Explanations

    instances of the second person pronoun "you"

    asking for definition or opinion

    indicators that the assistant/model is speaking or being directly addressed in a dialogue, such as speaker labels, turn markers, and second-person references.

    New Auto-Interp
    Negative Logits
    DeleteBehavior
    -0.52
    GraphicsUnit
    -0.51
     мәкал
    -0.50
    SharedCtor
    -0.49
     CanadaChoose
    -0.49
    書館
    -0.49
    Jeografia
    -0.47
    BagConstraints
    -0.47
    titleMargin
    -0.47
     EnglishChoose
    -0.47
    POSITIVE LOGITS
    ETHING
    0.35
    :][
    0.35
    zések
    0.34
    <!--
    
    0.33
    MLLoader
    0.31
    ftet
    0.31
    orei
    0.30
    zaak
    0.30
    ><!--
    0.29
    ="@
    0.29
    Act Density 0.000%

    No Known Activations