INDEX
Explanations
strong verbs or action-oriented words
prominent verbs indicating actions, possibilities, and conditions
New Auto-Interp
Negative Logits
Templ
-0.70
Deity
-0.65
ordinary
-0.64
inas
-0.63
Practices
-0.62
Div
-0.61
chapter
-0.60
Purchase
-0.59
Wonder
-0.57
Zone
-0.57
POSITIVE LOGITS
however
0.80
therefore
0.73
culmin
0.70
consist
0.69
consisted
0.68
varied
0.67
thus
0.64
acronym
0.64
furthermore
0.63
lasted
0.61
Activations Density 0.581%