INDEX
Explanations
terms related to movement or progress
instances of the word "go" in various contexts
New Auto-Interp
Negative Logits
lihood
-0.75
lished
-0.73
Advertisement
-0.71
alia
-0.69
stakes
-0.66
Responsibility
-0.64
expression
-0.62
andestine
-0.61
vich
-0.60
majority
-0.59
POSITIVE LOGITS
go
2.97
Go
1.83
go
1.81
Go
1.75
goes
1.61
GO
1.56
proceed
1.42
went
1.38
gone
1.36
stay
1.25
Activations Density 0.045%