INDEX
    Explanations

    phrases related to causality and outcomes

    phrases indicating outcomes or results that involve "in."

    New Auto-Interp
    Negative Logits
    wine
    -0.63
    dated
    -0.63
     outset
    -0.60
    STAT
    -0.60
    tell
    -0.59
    tenance
    -0.57
    hatt
    -0.56
     nurture
    -0.56
    heit
    -0.56
    deck
    -0.56
    POSITIVE LOGITS
    clusions
    0.76
    effic
    0.75
    illions
    0.74
    escap
    0.72
    efficiency
    0.69
    geoning
    0.66
    ordinate
    0.66
     noticeable
    0.65
     creating
    0.65
    plin
    0.64
    Act Density 0.055%

    No Known Activations