INDEX
Explanations
quantifiers suggesting high intensity or importance
phrases indicating high-quality or desirable attributes related to products or experiences
New Auto-Interp
Negative Logits
anism
-0.72
ews
-0.70
uploads
-0.69
\<
-0.68
atell
-0.65
ipedia
-0.64
AIDS
-0.64
bis
-0.64
Corp
-0.61
due
-0.61
POSITIVE LOGITS
fantastic
1.29
terrific
1.25
definite
1.24
wonderful
1.22
great
1.21
nice
1.18
delightful
1.16
lovely
1.15
perfect
1.13
wonderfully
1.12
Activations Density 0.237%