INDEX
Explanations
proper nouns and entities related to politics and governance
New Auto-Interp
Negative Logits
considérons
-0.53
للاسماء
-0.51
פשוט
-0.48
户
-0.44
diers
-0.44
thritis
-0.43
insee
-0.43
putra
-0.42
Kild
-0.42
dung
-0.42
POSITIVE LOGITS
ad
1.77
ads
1.39
Ad
1.25
ada
1.22
AD
1.17
Ad
1.17
ADA
1.15
ad
1.09
ady
1.06
ade
1.06
Activations Density 2.418%