INDEX
Explanations
instances of the word "role" and its context
references to the concept of "role" in various contexts
New Auto-Interp
Negative Logits
Latest
-0.76
Uri
-0.65
Bulg
-0.64
Bened
-0.64
False
-0.63
Sab
-0.63
mares
-0.62
Hig
-0.62
ãĥ©ãĥ³
-0.62
Lev
-0.61
POSITIVE LOGITS
roles
0.93
playing
0.90
role
0.86
role
0.80
reversal
0.79
uty
0.74
ioned
0.73
entials
0.72
ional
0.72
annel
0.71
Activations Density 0.028%