INDEX
    Explanations

    detailed descriptions or explanations

    statements about withholding information or not providing details

    New Auto-Interp
    Negative Logits
    ourses
    -0.62
    hiba
    -0.62
     reperto
    -0.61
     manag
    -0.59
    10000
    -0.58
    fif
    -0.57
    pes
    -0.57
    footed
    -0.56
    few
    -0.55
    thirds
    -0.54
    POSITIVE LOGITS
     anymore
    1.22
     myself
    0.94
     specifics
    0.94
     nor
    0.90
     anything
    0.85
     necessarily
    0.83
     ANY
    0.82
     anybody
    0.79
     spoilers
    0.79
     any
    0.78
    Act Density 0.399%

    No Known Activations