INDEX
Explanations
phrases related to dependency or reliance
phrases indicating reliance or dependence on specific factors or conditions
New Auto-Interp
Negative Logits
downed
-0.71
playable
-0.62
punishable
-0.62
showc
-0.61
confir
-0.60
repeated
-0.60
phased
-0.58
soar
-0.58
risen
-0.58
inspected
-0.57
POSITIVE LOGITS
onom
0.74
upon
0.73
sylv
0.71
wards
0.70
ateral
0.66
on
0.66
Towards
0.64
onne
0.64
ocus
0.64
on
0.63
Activations Density 0.106%