INDEX
Explanations
negativity
language signaling problems, faults, or negative conditions, often marked by negation or deficiency (e.g., invalidity, lack, inability, mis- states, failures, issues, or harms)
words with negative connotations indicating problems, defects, wrongness, or undesirable qualities.
New Auto-Interp
Negative Logits
ä¸Ģäºĭ
-0.28
ousy
-0.28
Saras
-0.27
.pub
-0.27
licas
-0.26
路人
-0.25
ertainment
-0.25
Shutterstock
-0.25
stä
-0.24
cased
-0.24
POSITIVE LOGITS
æĹ¶éĹ´ä¸º
0.29
ä½Ĩçͱäºİ
0.28
å¿ĥå¾Ĺ
0.27
riel
0.25
Meg
0.25
file
0.25
åį´æ²¡æľī
0.25
ek
0.25
饶
0.24
èħº
0.24
Activations Density 1.623%