INDEX
Explanations
references to religious places and their significance
New Auto-Interp
Negative Logits
Yemen
-0.17
Hudson
-0.16
ίλ
-0.15
apollo
-0.15
Cairo
-0.14
ëįĺ
-0.14
HEN
-0.14
Qin
-0.14
Uy
-0.14
.Selection
-0.14
POSITIVE LOGITS
Sikh
0.34
Sik
0.30
Guru
0.30
sangat
0.28
Pun
0.28
SG
0.27
lang
0.27
Gur
0.25
à©
0.24
à¨
0.24
Activations Density 0.069%