INDEX
Explanations
intensifying or amplifying adverbs
New Auto-Interp
Negative Logits
IsContent
-0.70
Jackman
-0.70
sauvages
-0.67
trecut
-0.66
ActionTypes
-0.62
toscana
-0.61
ded
-0.61
Gillette
-0.60
segü
-0.60
tanga
-0.60
POSITIVE LOGITS
very
1.02
Very
1.01
VERY
0.98
VERY
0.95
Very
0.93
very
0.93
Molto
0.84
Muy
0.79
sehr
0.79
umato
0.75
Activations Density 0.041%