INDEX
Explanations
phrases related to government actions and policies
New Auto-Interp
Negative Logits
ãĤ´ãĥ³
-0.69
©¶æ¥µ
-0.57
?).
-0.56
respectively
-0.55
.).
-0.51
):
-0.47
Reviewed
-0.46
é¾
-0.45
âĵĺ
-0.44
$.
-0.44
POSITIVE LOGITS
..."
1.31
â̦"
1.26
%"
1.25
,"
1.17
.")
1.16
),"
1.14
)"
1.11
[
1.10
,'"
1.08
)",
1.08
Activations Density 1.408%