INDEX
Explanations
negative sentiments and expressions of disillusionment
Questions, qualifiers, and subsequent words expressing doubt or insignificance
not important
New Auto-Interp
Negative Logits
енча
-0.61
SPATH
-0.56
المناصب
-0.56
rrggbb
-0.52
mixt
-0.52
Paglinawan
-0.48
Möglich
-0.48
estors
-0.48
BoxFit
-0.47
complémentaires
-0.47
POSITIVE LOGITS
meaningless
1.16
insignificant
1.13
unimportant
1.11
irrelevant
1.05
hardly
1.01
pointless
0.96
useless
0.96
negligible
0.95
Hardly
0.93
worthless
0.91
Activations Density 0.575%