INDEX
    Explanations

    words related to important functions or responsibilities

    references to the concept of "role" in various contexts

    New Auto-Interp
    Negative Logits
    Sab
    -0.71
    £ı
    -0.69
    False
    -0.69
    Latest
    -0.66
     Rapt
    -0.65
    agle
    -0.64
    Hig
    -0.63
    Ign
    -0.62
    Pic
    -0.61
    awk
    -0.60
    POSITIVE LOGITS
     roles
    1.06
     role
    0.95
    role
    0.88
     reversal
    0.87
    ioned
    0.80
     incent
    0.80
    playing
    0.77
     model
    0.76
     models
    0.73
    inic
    0.72
    Act Density 0.026%

    No Known Activations