INDEX
    Explanations

    the phrase "believe it or not."

    phrases that express doubt or disbelief

    New Auto-Interp
    Negative Logits
    istries
    -0.64
    culosis
    -0.59
     RH
    -0.58
     Grac
    -0.57
     Rollins
    -0.56
    nen
    -0.56
     HIP
    -0.56
    restling
    -0.55
     OU
    -0.55
    pload
    -0.55
    POSITIVE LOGITS
     versa
    0.83
     thereof
    0.83
    éĹ
    0.80
    .}
    0.70
    cffff
    0.67
    cffffcc
    0.65
    endif
    0.65
    ALSE
    0.64
     alike
    0.63
    depending
    0.63
    Act Density 0.097%

    No Known Activations