INDEX
Explanations
proper nouns associated with locations and names
New Auto-Interp
Negative Logits
alfa
-0.16
Peters
-0.15
rega
-0.14
yoksa
-0.14
isku
-0.14
orest
-0.14
ÑĢÑĸв
-0.14
ict
-0.13
rijk
-0.13
misc
-0.13
POSITIVE LOGITS
brothers
0.21
Brothers
0.21
ville
0.20
Family
0.19
æ°ı
0.18
family
0.18
sisters
0.17
stown
0.17
ì͍
0.17
sville
0.16
Activations Density 0.256%