INDEX
Explanations
specific mentions of actions or events that have "been" done
instances of the word "been" in various contexts
New Auto-Interp
Negative Logits
âĦ¢:
-0.71
Relief
-0.71
Awakens
-0.71
Must
-0.69
terday
-0.67
lies
-0.67
arta
-0.67
Wid
-0.66
ives
-0.66
odder
-0.66
POSITIVE LOGITS
replaced
0.99
subjected
0.98
likened
0.97
taken
0.96
shown
0.90
deemed
0.90
upgraded
0.89
given
0.89
omitted
0.89
relegated
0.88
Activations Density 0.160%