INDEX
Explanations
adjectives and phrases related to characteristics and behaviors, such as optimism, elegance, and relentlessness
New Auto-Interp
Negative Logits
slave
-0.79
ainer
-0.77
Ĥİ
-0.77
ploma
-0.74
orah
-0.74
OIL
-0.73
uther
-0.72
udder
-0.72
avers
-0.71
ittee
-0.70
POSITIVE LOGITS
ness
1.33
ly
1.23
nesses
1.11
nature
1.05
sounding
0.99
ones
0.95
minded
0.92
glers
0.91
NESS
0.91
souls
0.91
Activations Density 3.889%