INDEX
Explanations
expressions of enjoyment and pleasure
New Auto-Interp
Negative Logits
devenus
-0.59
ñata
-0.56
Maintenant
-0.55
troppo
-0.55
godfather
-0.54
élevées
-0.54
avions
-0.54
Gibbons
-0.54
FormState
-0.54
onCancelled
-0.54
POSITIVE LOGITS
pleasure
1.78
enjoy
1.52
delight
1.44
pleasure
1.43
Pleasure
1.41
joy
1.35
Enjoy
1.34
enjoyment
1.31
Enjoy
1.30
joy
1.24
Activations Density 0.046%