INDEX
Explanations
superlatives or comparisons of degree, such as "most" or "more."
New Auto-Interp
Negative Logits
æ©
-0.90
rompt
-0.88
heid
-0.83
pload
-0.82
Films
-0.79
oak
-0.78
instead
-0.76
eto
-0.74
undle
-0.74
ategories
-0.74
POSITIVE LOGITS
important
1.29
powerful
1.15
obvious
1.13
likely
1.12
prominent
1.10
interesting
1.10
plausible
1.10
likely
1.09
efficient
1.09
profitable
1.08
Activations Density 10.217%