INDEX
Explanations
phrases highlighting positive accomplishments and contributions
Adjectives of positive sentiment after articles
a positive adjective
New Auto-Interp
Negative Logits
때문
-0.52
neither
-0.51
quelconque
-0.49
siquiera
-0.48
<bos>
-0.47
다고
-0.45
вики
-0.44
UpDown
-0.44
ilidad
-0.43
ممکن
-0.43
POSITIVE LOGITS
wonderful
2.07
fantastic
2.06
terrific
1.92
excellent
1.87
amazing
1.80
wonderful
1.79
fantastic
1.77
fabulous
1.76
marvelous
1.75
great
1.73
Activations Density 0.376%