INDEX
Explanations
mentions of items or objects being described as "fancy"
references to the word "fancy."
New Auto-Interp
Negative Logits
etts
-0.78
upon
-0.77
sen
-0.75
scl
-0.72
essee
-0.70
arenthood
-0.68
iland
-0.67
————————————————
-0.66
IRO
-0.66
amaru
-0.65
POSITIVE LOGITS
fancy
1.23
pants
0.95
fanc
0.77
notions
0.74
tail
0.73
fries
0.71
nifty
0.70
dress
0.70
rous
0.70
vier
0.68
Activations Density 0.014%