INDEX
Explanations
phrases related to speculative or hypothetical scenarios
conditional phrases that imply hypothetical scenarios
New Auto-Interp
Negative Logits
idates
-0.72
throp
-0.67
Towns
-0.62
aires
-0.60
roxy
-0.60
Fraz
-0.59
Airl
-0.58
rebound
-0.57
vendors
-0.55
couples
-0.54
POSITIVE LOGITS
magically
0.80
Ãł
0.76
rael
0.74
somehow
0.71
invincible
0.65
è£ħ
0.65
superhuman
0.64
paste
0.64
SECTION
0.64
itting
0.63
Activations Density 0.186%