INDEX
Explanations
negative sentiments and expressions of doubt or concern
New Auto-Interp
Negative Logits
demais
-0.44
woordig
-0.40
montón
-0.39
Lots
-0.38
Herkunft
-0.37
banget
-0.36
própria
-0.35
staande
-0.35
Lots
-0.35
ellis
-0.34
POSITIVE LOGITS
quite
2.47
quite
2.19
Quite
2.17
Quite
2.14
uite
1.28
nearly
1.16
nearly
1.09
Nearly
1.08
ganz
1.02
Nearly
1.02
Activations Density 0.299%