INDEX
Explanations
negative sentiments or critical statements
New Auto-Interp
Negative Logits
latter
-0.27
ÐIJÑĢÑħÑĸвовано
-0.19
页éĿ¢åŃĺæ¡£å¤ĩ份
-0.18
ÐŁÐļ
-0.16
eniable
-0.16
phans
-0.16
جع
-0.15
longleftrightarrow
-0.14
Ø©
-0.14
CreateMap
-0.14
POSITIVE LOGITS
odore
0.24
adays
0.18
_ctxt
0.16
ilig
0.15
etheless
0.15
ris
0.14
же
0.14
atre
0.14
ÑįÑĤомÑĥ
0.14
xiety
0.14
Activations Density 0.200%