INDEX
Explanations
phrases related to leaving or escaping
phrases indicating a desire or need to escape or leave a situation
New Auto-Interp
Negative Logits
heartbeat
-0.70
Emin
-0.62
Brach
-0.59
Hera
-0.58
guyen
-0.54
gallery
-0.52
Huang
-0.51
slideshow
-0.51
hemisphere
-0.50
waning
-0.49
POSITIVE LOGITS
ta
1.15
alive
0.84
doors
0.83
bid
0.80
fitted
0.78
done
0.78
smart
0.77
stretched
0.76
played
0.74
last
0.72
Activations Density 0.057%