INDEX
Explanations
phrases related to health and well-being, particularly concerning community and family care
New Auto-Interp
Negative Logits
ấp
-0.15
lick
-0.15
ÑŁ
-0.15
arget
-0.14
olik
-0.14
bles
-0.14
owan
-0.14
daÅŁ
-0.13
edula
-0.13
bable
-0.13
POSITIVE LOGITS
ance
0.17
indir
0.16
fy
0.15
оки
0.15
future
0.14
ano
0.14
assy
0.14
ona
0.14
łí
0.13
ement
0.13
Activations Density 0.449%