INDEX
Explanations
negative sentiments or criticisms related to projects or developments
New Auto-Interp
Negative Logits
<bos>
-0.59
homogen
-0.49
行
-0.47
شك
-0.46
营
-0.45
Bra
-0.44
bracket
-0.43
trên
-0.43
Hinter
-0.43
řeb
-0.42
POSITIVE LOGITS
__':
1.49
)";
1.31
}}$}
1.29
__":
1.19
}")
1.18
)");
1.16
")));
1.15
!")
1.11
RectangleBorder
1.09
.")
1.08
Activations Density 0.060%