INDEX
Explanations
modifiers indicating a high degree of an attribute or action
intensifiers used to emphasize qualities or characteristics
New Auto-Interp
Negative Logits
enhagen
-0.75
iem
-0.71
Travels
-0.69
Pavilion
-0.68
RY
-0.66
DAQ
-0.65
Frames
-0.65
antry
-0.65
éŃĶ
-0.64
esi
-0.63
POSITIVE LOGITS
bad
0.97
bad
0.95
BAD
0.92
nasty
0.88
cool
0.88
GOOD
0.88
naughty
0.87
juicy
0.86
shitty
0.85
nice
0.84
Activations Density 0.108%