INDEX
Explanations
terms related to health, personal well-being, and societal issues affecting individuals
New Auto-Interp
Negative Logits
bourg
-0.08
llib
-0.08
oltip
-0.08
æ¤
-0.08
šet
-0.08
urtle
-0.07
ÅĻe
-0.07
eru
-0.07
sein
-0.07
hea
-0.07
POSITIVE LOGITS
personal
0.08
vas
0.07
religious
0.06
personal
0.06
intimate
0.06
connected
0.06
fine
0.06
sensitive
0.06
ãĥ¼ãĤ¹
0.06
bi
0.05
Activations Density 0.000%