INDEX
    Explanations

    sentences indicating beliefs or convictions

    New Auto-Interp
    Negative Logits
    pin
    -0.76
     Summit
    -0.68
    IDER
    -0.68
     Palmer
    -0.67
    Interstitial
    -0.66
    PIN
    -0.65
     Gamma
    -0.65
    MIN
    -0.65
     Incarnation
    -0.64
     advis
    -0.64
    POSITIVE LOGITS
     they
    1.03
    they
    1.03
    she
    0.90
    erers
    0.86
    sbm
    0.81
     he
    0.81
    rained
    0.76
     we
    0.75
    ü
    0.73
    rists
    0.73
    Act Density 0.107%

    No Known Activations