INDEX
Explanations
phrases that indicate factors influencing decision-making processes
New Auto-Interp
Negative Logits
CanadaChoose
-0.41
上
-0.34
内外
-0.33
oredCriteria
-0.32
APON
-0.32
amid
-0.32
RefNanny
-0.32
nữa
-0.32
favorite
-0.32
竟
-0.32
POSITIVE LOGITS
whereas
2.92
whereas
2.75
Whereas
2.59
Whereas
2.56
sedangkan
1.98
natomiast
1.88
conversely
1.84
tandis
1.82
Conversely
1.80
hingegen
1.77
Activations Density 0.880%