INDEX
Explanations
ideas related to health infrastructure and community living conditions
New Auto-Interp
Negative Logits
erras
-0.16
é«ĺéĢŁ
-0.15
ensem
-0.15
neutral
-0.14
Hurt
-0.14
imei
-0.14
voks
-0.14
,'#
-0.14
uzzi
-0.14
Reducers
-0.14
POSITIVE LOGITS
fil
0.34
rats
0.31
filthy
0.30
fil
0.30
conditions
0.28
filt
0.28
rat
0.27
Fil
0.26
Fil
0.25
filt
0.25
Activations Density 0.169%