INDEX
    Explanations

    instances where the text suggests questioning or pondering about a certain topic

    conditional phrases that express curiosity or uncertainty

    New Auto-Interp
    Negative Logits
    mouth
    -0.76
    oven
    -0.70
    adra
    -0.69
    along
    -0.66
    alt
    -0.65
    gate
    -0.64
    back
    -0.64
    ãĤ«
    -0.64
    together
    -0.64
    wash
    -0.63
    POSITIVE LOGITS
     finer
    0.71
     suspic
    0.67
     TAMADRA
    0.65
     morality
    0.64
     Grizz
    0.64
     possible
    0.64
     Juven
    0.63
    /+
    0.62
     "#
    0.59
     homosexuality
    0.58
    Act Density 0.149%

    No Known Activations