INDEX
Explanations
negative sentiments or implications
New Auto-Interp
Negative Logits
دانشنامهٔ
-0.82
itect
-0.70
\}.
-0.68
sizePolicy
-0.67
Hickey
-0.67
Hic
-0.66
ンドウ
-0.66
SDAY
-0.65
Rij
-0.64
bürger
-0.64
POSITIVE LOGITS
-
2.01
{-1.46
">-
1.45
-"
1.44
-}
1.44
>-</
1.40
'-
1.33
}-
1.31
-\
1.30
&-
1.29
Activations Density 0.479%