INDEX
Explanations
references to user-generated content and related user input
New Auto-Interp
Negative Logits
ulhu
-0.71
Slim
-0.71
knots
-0.67
Ital
-0.65
Jackets
-0.64
Reeves
-0.64
buck
-0.64
beads
-0.62
limp
-0.62
sunset
-0.61
POSITIVE LOGITS
generated
1.23
friendly
1.19
driven
1.14
oriented
1.09
favorite
1.09
centric
1.07
controlled
1.05
facing
1.05
centered
1.02
focused
1.01
Activations Density 0.061%