INDEX
Explanations
phrases indicating instructions or promises to the reader
instances of the verb "be" in various forms and contexts
New Auto-Interp
Negative Logits
arin
-0.70
scrimmage
-0.64
Implementation
-0.63
Stain
-0.62
ottage
-0.61
iasco
-0.61
truce
-0.61
Dialogue
-0.60
aceutical
-0.59
guise
-0.59
POSITIVE LOGITS
able
1.65
amazed
1.27
tempted
1.19
surprised
1.15
glad
1.14
unable
1.05
rewarded
1.05
wondering
1.03
pleasantly
1.02
thankful
1.01
Activations Density 0.122%