INDEX
Explanations
negative sentiments related to experiences, particularly in service and food reviews
New Auto-Interp
Negative Logits
ainfi
-0.85
enfans
-0.72
nôtre
-0.69
vieilles
-0.68
varandra
-0.67
allmän
-0.67
nemlig
-0.65
titolata
-0.65
hunne
-0.64
conseguenza
-0.64
POSITIVE LOGITS
=
0.69
noqa
0.60
+
0.60
plus
0.59
disambiguazione
0.59
autorytatywna
0.59
незавершена
0.58
责任编辑
0.58
no
0.56
OK
0.55
Activations Density 0.627%