INDEX
Explanations
words related to downward motion or decrease
phrases related to declining trends or decreases
New Auto-Interp
Negative Logits
cius
-0.77
thodox
-0.76
se
-0.73
ledge
-0.71
ific
-0.68
ioch
-0.66
iliary
-0.66
cyclopedia
-0.66
sic
-0.65
ilation
-0.65
POSITIVE LOGITS
BuyableInstoreAndOnline
0.95
Falling
0.84
prey
0.83
asleep
0.79
fall
0.77
owship
0.77
ãĥĥãĥī
0.74
graphene
0.74
falling
0.73
losses
0.69
Activations Density 0.013%