INDEX
Explanations
expressions of pride and affection in a conversation
New Auto-Interp
Negative Logits
którzy
-0.52
rumahnya
-0.49
conseillers
-0.49
("="-0.48
kteří
-0.48
steder
-0.47
culturali
-0.46
berdayakan
-0.46
Mitgliedern
-0.46
inclusief
-0.45
POSITIVE LOGITS
dear
1.05
sweetie
0.99
sweetheart
0.95
GEBURTSDATUM
0.91
buddy
0.90
lad
0.89
dear
0.87
darling
0.86
complexContent
0.84
babe
0.84
Activations Density 0.170%