INDEX
Explanations
expressions of affection and familial relationships
New Auto-Interp
Negative Logits
toep
-0.58
notoriously
-0.57
sige
-0.54
Off
-0.54
बजाय
-0.54
bang
-0.52
Biographie
-0.52
blz
-0.52
demonios
-0.52
propi
-0.52
POSITIVE LOGITS
🤍
0.75
بوابة
0.75
EndGlobalSection
0.75
Compassion
0.72
❤️
0.72
ьаж
0.71
compassionate
0.71
tenderly
0.68
♥️
0.68
💙
0.67
Activations Density 0.249%