INDEX
Explanations
words related to escaping or fleeing
mentions of the word "escape" in various contexts
New Auto-Interp
Negative Logits
sonian
-0.84
inki
-0.73
ammy
-0.71
trust
-0.71
eous
-0.69
iop
-0.67
older
-0.67
urally
-0.66
alky
-0.66
aic
-0.66
POSITIVE LOGITS
escape
1.03
escapes
0.93
escaped
0.89
Torment
0.82
escape
0.82
escaping
0.78
hatch
0.76
detection
0.74
APE
0.74
convict
0.71
Activations Density 0.009%