INDEX
    Explanations

    the word "turn" followed by a number indicating a significant action or change

    New Auto-Interp
    Negative Logits
    è¦ļéĨĴ
    -1.05
    ropolitan
    -0.92
    capacity
    -0.82
    mma
    -0.80
    inately
    -0.78
    ording
    -0.76
    foundation
    -0.72
    bour
    -0.71
    llah
    -0.71
    lain
    -0.69
    POSITIVE LOGITS
     awa
    0.87
     into
    0.85
    coat
    0.81
     shif
    0.77
    agra
    0.76
    around
    0.76
     inward
    0.75
     crank
    0.75
    about
    0.75
     sour
    0.74
    Act Density 8.651%

    No Known Activations