INDEX
Explanations
comparative phrases and conjunctions that indicate contrast or distinction
New Auto-Interp
Negative Logits
obot
-0.17
ä¸ĬäºĨ
-0.14
uri
-0.14
uml
-0.13
सन
-0.13
ran
-0.13
одав
-0.13
ạy
-0.13
à¤Ńव
-0.13
/socket
-0.13
POSITIVE LOGITS
others
0.18
others
0.18
ao
0.17
bagi
0.17
whereas
0.17
Whereas
0.15
ysi
0.15
juana
0.15
iya
0.15
335
0.14
Activations Density 0.138%