INDEX
Explanations
entities related to Turkish government officials
specific surnames and proper nouns, especially in the context of political figures or entities
New Auto-Interp
Negative Logits
Ivy
-0.83
Hollywood
-0.78
Hearts
-0.71
Wonderland
-0.65
Harvard
-0.64
Kitty
-0.64
Candy
-0.63
Charisma
-0.63
Crusade
-0.62
Dartmouth
-0.62
POSITIVE LOGITS
istani
1.16
ÄŁ
1.15
ı
1.13
oÄŁan
1.03
lu
0.96
rates
0.95
endar
0.94
uz
0.93
lish
0.90
ugal
0.90
Activations Density 0.010%