INDEX
Explanations
proper nouns, particularly names of people and organizations
New Auto-Interp
Negative Logits
оÑī
-0.17
Tone
-0.15
OperationException
-0.15
Wars
-0.15
Adv
-0.15
emm
-0.15
Rencontres
-0.14
inals
-0.14
abeth
-0.14
ears
-0.14
POSITIVE LOGITS
à¹Ģà¸ķà¸Ńร
0.17
asse
0.15
Lafayette
0.15
assel
0.15
bris
0.15
brit
0.14
šť
0.14
รà¸ĵ
0.14
lv
0.14
óż
0.14
Activations Density 0.051%