INDEX
    Explanations

    surprising or unexpected events or outcomes

    New Auto-Interp
    Negative Logits
    tein
    -0.87
    nan
    -0.86
    folios
    -0.84
    asus
    -0.82
    agra
    -0.81
    oreal
    -0.80
    ©¶æ
    -0.79
    odynam
    -0.78
    haps
    -0.75
    uel
    -0.74
    POSITIVE LOGITS
     surprise
    0.83
     surprises
    0.82
     guests
    0.80
     Flavoring
    0.78
     visitor
    0.78
     absor
    0.77
     Surprise
    0.77
    ingly
    0.75
     Squid
    0.70
     Pew
    0.68
    Act Density 0.036%

    No Known Activations