INDEX
Explanations
phrases indicating assistance, support, or benefits
references to assistance or support
New Auto-Interp
Negative Logits
theless
-0.75
scanned
-0.73
æ³
-0.67
ata
-0.65
aults
-0.63
TABLE
-0.59
ulations
-0.58
tains
-0.58
avior
-0.57
wear
-0.57
POSITIVE LOGITS
alleviate
0.89
lessen
0.81
relieve
0.80
disguise
0.79
fully
0.79
stabilize
0.79
morale
0.78
strengthen
0.78
tremendously
0.77
ease
0.76
Activations Density 0.065%