INDEX
    Explanations

    phrases related to official statements or reports

    repeated instances of the word "the."

    New Auto-Interp
    Negative Logits
    izons
    -0.73
    illion
    -0.71
    Its
    -0.70
    OTA
    -0.70
    ambo
    -0.67
    oscope
    -0.67
    spell
    -0.66
    dale
    -0.66
    rape
    -0.65
    terday
    -0.65
    POSITIVE LOGITS
     easiest
    0.94
     strongest
    0.73
     impossible
    0.72
     raining
    0.71
     uphill
    0.71
     safest
    0.71
     customary
    0.69
     usual
    0.68
     hardest
    0.68
     simplest
    0.66
    Act Density 0.140%

    No Known Activations