INDEX
Explanations
instances of the word "Another" used to introduce a new piece of information or detail
New Auto-Interp
Negative Logits
bows
-0.90
hips
-0.85
endas
-0.81
folios
-0.79
ouls
-0.79
icides
-0.79
riages
-0.78
anism
-0.78
olas
-0.78
alties
-0.77
POSITIVE LOGITS
worldly
1.06
example
1.03
aspect
1.01
thing
0.98
factor
0.97
notable
0.97
noteworthy
0.96
complication
0.95
important
0.94
reason
0.94
Activations Density 0.054%