INDEX
    Explanations

    questions or statements expressing curiosity or doubt

    questions expressing curiosity or inquiry about circumstances and outcomes

    New Auto-Interp
    Negative Logits
    catentry
    -0.83
    20439
    -0.77
    mouth
    -0.75
    alysed
    -0.74
    arget
    -0.69
    ongs
    -0.66
    aration
    -0.66
    interstitial
    -0.66
    idation
    -0.65
    idelines
    -0.65
    POSITIVE LOGITS
    xual
    0.85
     suspic
    0.80
     misunder
    0.79
     nostalg
    0.76
     why
    0.72
     fate
    0.71
     retribution
    0.70
     motives
    0.70
     millenn
    0.70
     explan
    0.70
    Act Density 0.108%

    No Known Activations