INDEX
Explanations
mentions of locations related to communities and medical facilities
New Auto-Interp
Negative Logits
ADDE
-0.19
oba
-0.14
ugo
-0.14
AGO
-0.14
usercontent
-0.14
PLE
-0.14
mainland
-0.14
Ý
-0.14
thon
-0.13
uum
-0.13
POSITIVE LOGITS
Kir
0.26
Modi
0.25
Beer
0.24
Bet
0.24
Ein
0.23
Ram
0.22
Net
0.22
Lod
0.22
Nah
0.21
Ein
0.20
Activations Density 0.031%