INDEX
    Explanations

    recipes or instructions

    words related to appreciation and recognition

    New Auto-Interp
    Negative Logits
     haz
    -0.73
     Bermuda
    -0.68
     Catalyst
    -0.68
     boarding
    -0.66
     Belt
    -0.66
     Hallow
    -0.65
     Butt
    -0.65
     Canary
    -0.63
     purse
    -0.63
     Span
    -0.63
    POSITIVE LOGITS
    reci
    1.42
    ocity
    1.10
    ating
    0.98
    apy
    0.97
    issance
    0.96
    ated
    0.96
    bled
    0.93
    ate
    0.93
    ased
    0.92
    asing
    0.90
    Act Density 0.015%

    No Known Activations