INDEX
Explanations
references to social issues and economic factors impacting society
New Auto-Interp
Negative Logits
ç¤
-0.15
Fet
-0.15
ucs
-0.15
outright
-0.15
because
-0.15
ziel
-0.14
oltip
-0.14
ÙħÙĤابÙĦ
-0.14
porque
-0.14
ãĤħ
-0.14
POSITIVE LOGITS
availability
0.15
èĢĮ
0.15
adt
0.15
Availability
0.14
illi
0.14
Alone
0.14
lund
0.14
chor
0.14
Vernon
0.14
okol
0.14
Activations Density 0.352%