INDEX
    Explanations

    phrases related to giving advice or sharing instructions in casual or gaming contexts

    New Auto-Interp
    Negative Logits
    irm
    -0.74
    rounder
    -0.69
    irmed
    -0.62
    chio
    -0.61
    thia
    -0.61
    tty
    -0.61
    olly
    -0.60
    ellow
    -0.59
    anus
    -0.58
    asts
    -0.57
    POSITIVE LOGITS
     absurdity
    0.80
    lessness
    0.78
     brink
    0.78
    liest
    0.77
     where
    0.76
     extent
    0.75
     verge
    0.73
    ophys
    0.71
     exhaustion
    0.70
    points
    0.70
    Act Density 0.028%

    No Known Activations