INDEX
Explanations
phrases related to avoiding consequences or breaking rules
instances of the phrase "get away" with various numerical values indicating frequency and context
New Auto-Interp
Negative Logits
Yamato
-0.63
Merrill
-0.61
urgency
-0.59
Interstitial
-0.59
Bulg
-0.57
succession
-0.55
Takeru
-0.55
Zeit
-0.54
Laksh
-0.53
GMT
-0.52
POSITIVE LOGITS
safely
0.73
oned
0.68
unsc
0.64
uced
0.64
pun
0.62
door
0.61
ped
0.60
yp
0.59
=~=~
0.59
ixed
0.58
Activations Density 0.047%