INDEX
Explanations
adjectives and descriptors that indicate performance or quality
New Auto-Interp
Negative Logits
ulhu
-0.88
objects
-0.83
artifacts
-0.82
urgently
-0.82
Ingredients
-0.80
SPONSORED
-0.79
ifles
-0.79
upon
-0.77
types
-0.75
iddles
-0.73
POSITIVE LOGITS
outing
1.24
weekend
1.23
upbringing
1.17
stint
1.15
offseason
1.09
week
1.09
season
1.07
shootout
1.00
month
1.00
summer
0.99
Activations Density 0.128%