INDEX
Explanations
phrases indicating presence, assistance, or declaration
phrases indicating purpose or intention
New Auto-Interp
Negative Logits
reliance
-0.74
attributed
-0.69
resorted
-0.68
reliant
-0.67
fielded
-0.62
ancest
-0.62
synchronization
-0.62
Classification
-0.60
attributable
-0.60
exposures
-0.59
POSITIVE LOGITS
brate
0.92
celebrate
0.83
stay
0.83
wark
0.80
orate
0.79
ivo
0.77
othe
0.77
uphold
0.76
attery
0.76
swer
0.75
Activations Density 0.119%