INDEX
    Explanations

    phrases related to official statements, actions, or events

    New Auto-Interp
    Negative Logits
     unspeak
    -0.76
     maneu
    -0.69
     FFFF
    -0.65
     outlander
    -0.64
     impra
    -0.63
     ACKNOWLEDGMENTS
    -0.63
     snoopy
    -0.63
     disagre
    -0.61
     vincent
    -0.61
     indescri
    -0.61
    POSITIVE LOGITS
    been
    0.87
     BEEN
    0.79
    Been
    0.79
     kayo
    0.76
     intende
    0.76
     been
    0.74
     \%$\\
    0.65
    ISHOP
    0.64
     Muhamma
    0.64
    Faites
    0.63
    Act Density 0.207%

    No Known Activations