INDEX
Explanations
verbs in possessive form
repeated instances of the letter 's'
New Auto-Interp
Negative Logits
EStream
-0.71
uliffe
-0.68
adjunct
-0.65
EStreamFrame
-0.63
Leilan
-0.63
controls
-0.63
displ
-0.60
boycot
-0.60
illac
-0.60
calculating
-0.59
POSITIVE LOGITS
lightly
1.04
atisf
1.00
omew
0.96
ources
0.96
aved
0.93
wered
0.93
ouls
0.92
ond
0.91
uddenly
0.91
ustain
0.90
Activations Density 0.256%