INDEX
Explanations
phrases related to surprising or strange facts or events
phrases that express surprising or unexpected outcomes
New Auto-Interp
Negative Logits
robe
-0.75
irie
-0.74
guiActiveUnfocused
-0.72
enhagen
-0.70
gang
-0.70
ļéĨĴ
-0.66
iami
-0.65
enaries
-0.64
eria
-0.62
ioxide
-0.62
POSITIVE LOGITS
especially
1.04
huh
1.02
though
1.01
albeit
0.98
considering
0.98
but
0.98
however
0.95
although
0.94
but
0.90
because
0.88
Activations Density 0.288%