INDEX
Explanations
expressions of love and affection
New Auto-Interp
Negative Logits
isInitialized
-0.59
minator
-0.55
NUKAT
-0.55
reaper
-0.54
transférez
-0.53
("-");-0.52
kasarigan
-0.52
propOrder
-0.51
său
-0.51
énario
-0.50
POSITIVE LOGITS
love
3.41
loved
3.20
loves
3.08
LOVE
3.00
love
2.97
Love
2.90
Love
2.83
loving
2.82
LOVE
2.79
loved
2.79
Activations Density 0.045%