INDEX
Explanations
terms related to hygiene and cleanliness practices
New Auto-Interp
Negative Logits
ÙĪÙĨا
-0.15
kim
-0.15
ONO
-0.14
oric
-0.14
xygen
-0.14
dent
-0.14
-License
-0.14
gross
-0.14
rej
-0.14
lm
-0.14
POSITIVE LOGITS
å±Ģ
0.14
bedo
0.14
opts
0.14
ephy
0.14
Bark
0.13
thest
0.13
ีà¸Ńย
0.13
vier
0.13
bery
0.13
Surf
0.13
Activations Density 0.011%