INDEX
    Explanations

    the word "role" in different contexts

    New Auto-Interp
    Negative Logits
    glass
    -0.69
    BUG
    -0.67
    locks
    -0.63
    fol
    -0.63
    iel
    -0.62
    Lock
    -0.61
    itsu
    -0.60
    Apply
    -0.60
    ouf
    -0.60
    sticks
    -0.60
    POSITIVE LOGITS
     shaping
    1.31
     determining
    1.13
    ordinate
    1.04
     influencing
    1.03
     facilitating
    1.02
     perpet
    0.98
     helping
    0.92
     ensuring
    0.91
     regulating
    0.90
    clus
    0.89
    Act Density 0.120%

    No Known Activations