INDEX
    Explanations

    dates written in a specific format (full weekday, month, day, year)

    New Auto-Interp
    Negative Logits
     unpre
    -0.76
     Prelude
    -0.69
     apprehension
    -0.66
     bottleneck
    -0.66
     psy
    -0.65
     manifold
    -0.65
     revived
    -0.64
     Metallic
    -0.64
     doubling
    -0.63
     flares
    -0.63
    POSITIVE LOGITS
    isdom
    1.34
    alking
    1.33
    orst
    1.33
    idespread
    1.30
    restling
    1.30
    atson
    1.29
    esley
    1.29
    orthy
    1.25
    izards
    1.25
    olves
    1.25
    Act Density 0.029%

    No Known Activations