INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    potion
    -0.71
    CHAT
    -0.68
    morning
    -0.66
    Recommend
    -0.64
    selection
    -0.64
     Kara
    -0.63
     pier
    -0.62
     Factor
    -0.61
    Channel
    -0.61
    Bay
    -0.61
    POSITIVE LOGITS
    hips
    1.15
    chool
    1.06
    hip
    1.01
    agascar
    0.87
    mith
    0.80
    ystem
    0.80
     governments
    0.80
     collide
    0.80
     empires
    0.78
     alike
    0.78
    Act Density 0.184%

    No Known Activations