INDEX
    Explanations

    phrases related to different types of snacks

    New Auto-Interp
    Negative Logits
     negatives
    -0.72
    idency
    -0.66
     Tsarnaev
    -0.64
    ne
    -0.63
     Masonic
    -0.62
     Angels
    -0.61
     priesthood
    -0.60
    ocal
    -0.59
     militant
    -0.58
    20439
    -0.58
    POSITIVE LOGITS
     snacks
    1.04
     snack
    1.00
    eteria
    0.96
    washer
    0.91
     eaten
    0.89
     foods
    0.88
    cake
    0.87
     eater
    0.87
    iness
    0.87
    oleon
    0.87
    Act Density 0.013%

    No Known Activations