INDEX
Explanations
verbs related to avoiding something
phrases related to evading difficult or unwanted situations
New Auto-Interp
Negative Logits
raq
-0.89
uid
-0.83
lease
-0.80
tek
-0.76
STAT
-0.75
soon
-0.73
Roy
-0.71
old
-0.70
nai
-0.69
hered
-0.69
POSITIVE LOGITS
pitfalls
1.37
wasting
1.13
detection
1.12
harming
1.08
mentioning
1.06
collisions
1.06
distractions
1.05
paying
1.05
confrontation
1.05
unnecessary
1.04
Activations Density 0.074%