INDEX
Explanations
instances where the phrase "go back" or "go" is mentioned in the text
phrases indicating the action of going or returning somewhere
New Auto-Interp
Negative Logits
icons
-0.67
bidden
-0.64
rely
-0.63
Watkins
-0.61
Category
-0.60
igo
-0.59
warrants
-0.57
Arri
-0.57
licenses
-0.57
inguished
-0.56
POSITIVE LOGITS
ahead
0.98
ggle
0.98
buy
0.95
forth
0.94
fuck
0.93
pursue
0.92
verning
0.91
retrieve
0.91
investigate
0.90
explore
0.88
Activations Density 0.067%