INDEX
Explanations
references to historical and geopolitical entities, specifically related to empires and their territories
New Auto-Interp
Negative Logits
karak
-0.16
Malcolm
-0.15
nest
-0.15
feld
-0.15
Nested
-0.14
nested
-0.14
shower
-0.14
nesting
-0.14
iser
-0.13
_nested
-0.13
POSITIVE LOGITS
å½
0.17
akin
0.17
riz
0.15
oundary
0.15
885
0.15
esz
0.14
oulouse
0.14
Slam
0.14
agli
0.14
ulls
0.14
Activations Density 0.218%