INDEX
Explanations
phrases indicating consequences or reactions related to policy decisions
Follows a conjunction/adverb
skepticism and criticism
New Auto-Interp
Negative Logits
wonderful
-0.59
nahilalakip
-0.55
meravigli
-0.54
merveille
-0.54
skall
-0.53
wonderful
-0.52
correcte
-0.52
مشين
-0.50
mentira
-0.50
الأصل
-0.48
POSITIVE LOGITS
critics
0.82
مرئيه
0.79
Critics
0.76
Critics
0.70
lenker
0.69
tensions
0.69
Analysts
0.69
skepticism
0.67
skep
0.67
amid
0.67
Activations Density 0.287%