INDEX
Explanations
mentions of aid or support, particularly in the context of communities and healthcare
New Auto-Interp
Negative Logits
rik
-0.15
.flink
-0.14
hed
-0.14
>tag
-0.14
交
-0.14
quet
-0.14
mana
-0.14
alus
-0.13
ushima
-0.13
crest
-0.13
POSITIVE LOGITS
fit
0.18
tailor
0.17
account
0.17
Tail
0.17
dedicate
0.16
عداد
0.16
fits
0.16
rises
0.16
rise
0.16
train
0.16
Activations Density 0.009%