INDEX
Explanations
phrases indicating willingness or ability to take action
instances of the verb "to be" in various forms and contexts
New Auto-Interp
Negative Logits
izable
-0.76
Acquisition
-0.67
attempt
-0.65
Manufact
-0.63
fade
-0.63
fray
-0.60
Must
-0.59
reperto
-0.58
Wid
-0.57
circulation
-0.57
POSITIVE LOGITS
bothered
1.14
persuaded
0.99
sure
0.96
swayed
0.96
trusted
0.95
forgiven
0.94
confident
0.91
fooled
0.90
tempted
0.88
assured
0.86
Activations Density 0.076%