INDEX
Explanations
mentions of Egypt and its related terms
New Auto-Interp
Negative Logits
kke
-0.17
lify
-0.17
leck
-0.16
vig
-0.16
agen
-0.15
ogn
-0.15
illac
-0.14
GAN
-0.14
esk
-0.14
skill
-0.14
POSITIVE LOGITS
Nile
0.24
Egyptian
0.23
Egypt
0.22
ors
0.21
Egyptians
0.21
Egypt
0.20
ology
0.19
Cairo
0.19
ual
0.18
uez
0.18
Activations Density 0.014%