INDEX
Explanations
mentions of roles and responsibilities in various contexts
New Auto-Interp
Negative Logits
apon
-0.18
iglia
-0.18
rael
-0.17
don
-0.16
rale
-0.16
APON
-0.16
mitt
-0.15
Mov
-0.15
rit
-0.14
fold
-0.14
POSITIVE LOGITS
-playing
0.23
playing
0.23
(Role
0.21
played
0.20
(role
0.19
.Role
0.19
played
0.19
ROLE
0.18
Role
0.17
Played
0.17
Activations Density 0.030%