INDEX
Explanations
proper nouns and specific references, particularly related to individuals and institutions
New Auto-Interp
Negative Logits
й
-0.94
Bartol
-0.89
一个
-0.89
manufact
-0.89
__*/
-0.89
Gott
-0.87
IIIIIIII
-0.82
йки
-0.82
Baldwin
-0.81
Sapi
-0.81
POSITIVE LOGITS
soeur
0.95
に
0.93
nationaux
0.92
suivie
0.88
ab
0.85
Lynd
0.84
ly
0.83
ag
0.82
sabbia
0.82
moeite
0.79
Activations Density 2.158%