INDEX
Explanations
phrases indicating leadership or direction
New Auto-Interp
Negative Logits
pleaſure
-0.79
Autoritní
-0.66
faſt
-0.66
purpoſe
-0.65
Rina
-0.63
leçon
-0.62
équi
-0.62
Paulina
-0.61
tagHelper
-0.60
Helga
-0.60
POSITIVE LOGITS
AddRange
0.67
sosok
0.57
Mou
0.56
ousted
0.54
❋
0.53
IBOutlet
0.52
him
0.50
הערות
0.49
حوالہ
0.48
charismatic
0.48
Activations Density 0.338%