INDEX
Explanations
words related to abstention or refraining from certain actions
terms and phrases related to abstaining or abstinence
New Auto-Interp
Negative Logits
WM
-0.79
esan
-0.76
quickShipAvailable
-0.75
ãĥį
-0.75
xual
-0.71
Hug
-0.70
DragonMagazine
-0.69
STEM
-0.69
Els
-0.69
ppa
-0.67
POSITIVE LOGITS
abst
0.86
itures
0.84
sb
0.83
atory
0.82
rol
0.81
ention
0.81
ain
0.80
ences
0.79
inent
0.79
iencies
0.79
Activations Density 0.029%