INDEX
Explanations
French words, especially those related to organizations and locations
proper nouns and specific terms related to organizations and locations
New Auto-Interp
Negative Logits
KY
-0.77
MSN
-0.76
Ky
-0.74
urities
-0.70
tumblr
-0.69
staking
-0.69
schild
-0.69
Introduced
-0.68
terms
-0.67
visible
-0.66
POSITIVE LOGITS
itaire
0.92
faire
0.61
mathemat
0.60
fatig
0.58
Ninth
0.58
Portug
0.58
nephew
0.57
Resort
0.56
Latino
0.55
facult
0.55
Activations Density 0.213%