INDEX
Explanations
words expressing positivity and admiration
New Auto-Interp
Negative Logits
IsContent
-0.69
gypti
-0.64
almeno
-0.62
Slf
-0.62
allegedly
-0.60
autora
-0.60
AdapterView
-0.57
silencio
-0.55
CacheManager
-0.55
Cassius
-0.55
POSITIVE LOGITS
wonderful
3.04
wonderful
2.76
Wonderful
2.57
Wonderful
2.56
marvelous
2.35
marvellous
2.18
fantastic
2.16
fabulous
2.05
terrific
2.04
wonderfully
2.02
Activations Density 0.052%