INDEX
Explanations
sentiments indicating hope and opportunity
New Auto-Interp
Negative Logits
='".
-0.15
лиÑĪ
-0.14
rics
-0.14
aise
-0.14
raphics
-0.14
kel
-0.14
surrounds
-0.14
enso
-0.13
pers
-0.13
ัà¸ģร
-0.13
POSITIVE LOGITS
other
0.28
another
0.28
Other
0.25
Other
0.24
Another
0.24
Another
0.23
another
0.22
other
0.22
Similarly
0.21
andere
0.20
Activations Density 0.170%