INDEX
Explanations
mentions of the word "yoga"
terms related to yoga and its culture
New Auto-Interp
Negative Logits
etary
-0.88
hyde
-0.88
士
-0.82
ilial
-0.75
olver
-0.73
ppelin
-0.73
ly
-0.70
itiveness
-0.69
itions
-0.67
lies
-0.67
POSITIVE LOGITS
pants
0.82
pants
0.76
Yog
0.72
Pants
0.70
walk
0.69
Yoga
0.69
yog
0.68
flats
0.67
instructor
0.65
studios
0.64
Activations Density 0.093%