INDEX
Explanations
adjectives describing positive qualities or states
instances of the word "good."
New Auto-Interp
Negative Logits
Span
-0.66
span
-0.63
offshore
-0.61
mount
-0.59
ument
-0.58
accus
-0.58
blaze
-0.58
spans
-0.58
elevation
-0.57
tens
-0.57
POSITIVE LOGITS
good
3.92
Good
2.14
GOOD
1.85
Good
1.68
bad
1.64
better
1.59
good
1.58
nice
1.40
great
1.27
Excellent
1.25
Activations Density 0.006%