INDEX
Negative Logits
RESPONSE
-0.09
outfits
-0.08
Outfit
-0.08
ిస్
-0.08
Dish
-0.08
alité
-0.07
indulge
-0.07
mastermind
-0.07
Interestingly
-0.07
Interesting
-0.07
POSITIVE LOGITS
Grant
0.08
unan
0.08
745
0.08
忧
0.07
tas
0.07
030
0.07
cytok
0.07
Grant
0.07
115
0.07
variability
0.07
Activations Density 0.001%