INDEX
Explanations
words related to important functions or responsibilities
references to the concept of "role" in various contexts
New Auto-Interp
Negative Logits
Sab
-0.71
£ı
-0.69
False
-0.69
Latest
-0.66
Rapt
-0.65
agle
-0.64
Hig
-0.63
Ign
-0.62
Pic
-0.61
awk
-0.60
POSITIVE LOGITS
roles
1.06
role
0.95
role
0.88
reversal
0.87
ioned
0.80
incent
0.80
playing
0.77
model
0.76
models
0.73
inic
0.72
Activations Density 0.026%