INDEX
    Explanations

    proper nouns, specifically names of individuals and locations

    New Auto-Interp
    Negative Logits
     ویکی‌پدی
    -0.69
     EconPapers
    -0.65
    MLLoader
    -0.59
    PreferredItem
    -0.56
     CreateTagHelper
    -0.56
    <tfoot>
    -0.50
     typelib
    -0.50
    WriteAttribute
    -0.49
    pushFollow
    -0.49
     FBref
    -0.49
    POSITIVE LOGITS
     Belgien
    0.44
     Chemist
    0.42
     Nelly
    0.41
    ellido
    0.41
    Deutschland
    0.40
     politician
    0.40
     Knights
    0.39
    🐖
    0.39
     Soares
    0.39
    🐷
    0.39
    Act Density 0.513%

    No Known Activations