INDEX
Explanations
positive adjectives describing something of significant importance or impact
adjectives conveying a sense of significance or impact
New Auto-Interp
Negative Logits
ults
-0.96
ses
-0.86
ravings
-0.85
doms
-0.82
assies
-0.82
bots
-0.80
apses
-0.79
apons
-0.79
Surve
-0.79
units
-0.77
POSITIVE LOGITS
understatement
0.98
sleeper
0.88
antidote
0.87
reminder
0.82
boon
0.81
example
0.79
contender
0.79
spoiler
0.76
fundraiser
0.76
spokesperson
0.76
Activations Density 0.580%