INDEX
Explanations
genres and categories related to health, wellness, and personal development
New Auto-Interp
Negative Logits
afil
-0.15
Ú¯ÙĪÛĮ
-0.15
ÙĤÙħ
-0.14
ÏģÏħ
-0.14
ìĨį
-0.14
gom
-0.13
Blasio
-0.13
å¤
-0.13
appropri
-0.13
OPTION
-0.13
POSITIVE LOGITS
Reference
0.25
reference
0.23
Reference
0.23
reference
0.21
spacer
0.20
/reference
0.19
Barg
0.19
-reference
0.17
åıĤèĢĥ
0.17
_reference
0.17
Activations Density 0.009%