INDEX
Explanations
geographical locations and names
New Auto-Interp
Negative Logits
R
-0.15
emen
-0.15
ipp
-0.15
hle
-0.14
Yus
-0.14
leta
-0.14
elf
-0.14
ecs
-0.14
Maj
-0.14
echan
-0.14
POSITIVE LOGITS
çĶŁ
0.19
born
0.18
Born
0.16
Born
0.16
ONA
0.15
üstü
0.15
raised
0.15
born
0.15
grew
0.15
indo
0.15
Activations Density 0.057%