INDEX
Explanations
expressions related to gratitude and appreciation
New Auto-Interp
Negative Logits
oine
-0.15
edio
-0.15
igue
-0.15
ventus
-0.15
ouz
-0.14
.rdf
-0.14
Bilim
-0.14
ione
-0.14
odate
-0.14
andbox
-0.14
POSITIVE LOGITS
ola
0.20
Lagos
0.19
Ol
0.19
emi
0.19
Ade
0.18
iola
0.18
erin
0.17
ipe
0.17
ADED
0.17
OLA
0.17
Activations Density 0.071%