INDEX
Explanations
actions related to offering or providing help
New Auto-Interp
Negative Logits
theless
-0.79
iu
-0.72
meat
-0.69
rencies
-0.62
posure
-0.61
ata
-0.61
é¾
-0.59
Pict
-0.59
amba
-0.58
Hebdo
-0.58
POSITIVE LOGITS
alleviate
1.00
facilitate
0.97
stabilize
0.89
improve
0.88
organize
0.87
fully
0.87
relieve
0.85
propel
0.85
solve
0.85
tremendously
0.84
Activations Density 1.245%