INDEX
Explanations
phrases related to expressing preferences or opinions
expressions of concern and emotional support within interpersonal relationships
New Auto-Interp
Negative Logits
etheless
-0.93
ãĤ´ãĥ³
-0.87
¥ŀ
-0.84
ortium
-0.81
*:
-0.78
Indeed
-0.75
âĦ¢:
-0.74
surprisingly
-0.73
UGC
-0.72
éŃĶ
-0.72
POSITIVE LOGITS
,'"
1.40
â̦"
1.35
.")
1.28
.'"
1.28
..."
1.23
?'"
1.22
',"
1.20
!'"
1.14
),"
1.14
,"
1.13
Activations Density 0.917%