INDEX
Explanations
positive sentiments and expressions of satisfaction in reviews
New Auto-Interp
Negative Logits
_DECREF
-0.15
NES
-0.14
ertest
-0.14
dle
-0.14
dll
-0.14
Moses
-0.14
iens
-0.14
еÑģÑĤв
-0.13
Nickel
-0.13
ÑĢабаÑĤ
-0.13
POSITIVE LOGITS
hev
0.18
urtle
0.17
Afr
0.15
ird
0.15
ÑĥÑĢÑĥ
0.15
ya
0.14
compact
0.14
compliments
0.14
itsu
0.14
obb
0.14
Activations Density 0.047%