INDEX
Explanations
proper nouns and names associated with institutions, foundations, and locations
New Auto-Interp
Negative Logits
art
-0.18
ë°ľ
-0.16
ilan
-0.16
æ´²
-0.15
iana
-0.14
yer
-0.14
nad
-0.14
hind
-0.13
itters
-0.13
Roe
-0.13
POSITIVE LOGITS
avra
0.15
ACL
0.15
ALSE
0.15
ichick
0.14
eken
0.14
bris
0.13
//!<
0.13
-League
0.13
misunder
0.13
à¥Įन
0.13
Activations Density 0.028%