INDEX
Explanations
phrases related to bouncing back or returning to a previous state or position
New Auto-Interp
Negative Logits
caut
-0.63
ision
-0.62
Newly
-0.58
NEWS
-0.58
notoriously
-0.57
cowork
-0.57
orst
-0.56
understatement
-0.56
inexper
-0.56
weather
-0.56
POSITIVE LOGITS
fires
1.09
fired
1.02
packs
1.00
dated
0.98
doors
0.91
tracking
0.90
home
0.82
spin
0.81
home
0.80
wards
0.80
Activations Density 0.035%