INDEX
Explanations
sentences discussing states of being or activities
emotional expressions and responses to experiences
New Auto-Interp
Negative Logits
habi
-0.78
ctuary
-0.75
hovah
-0.67
osponsors
-0.67
tains
-0.66
ocumented
-0.64
itutes
-0.61
scribe
-0.60
ividual
-0.59
ocument
-0.57
POSITIVE LOGITS
hadn
0.70
Was
0.68
calmed
0.64
inval
0.63
hurried
0.62
rushed
0.61
didn
0.61
went
0.60
Turns
0.60
felt
0.60
Activations Density 1.598%