INDEX
Explanations
references to actions starting with the word "opened"
instances of the word "open" and its variations
New Auto-Interp
Negative Logits
orum
-0.67
rior
-0.66
CTV
-0.65
grade
-0.64
iable
-0.61
stood
-0.60
âĨij
-0.59
ashington
-0.58
Bey
-0.57
constitu
-0.56
POSITIVE LOGITS
Doors
0.94
doors
0.92
up
0.87
wounds
0.81
ource
0.77
lique
0.77
UP
0.77
parentheses
0.74
bucks
0.72
portals
0.72
Activations Density 0.042%