INDEX
    Explanations

    proper nouns, particularly names and locations related to historical events

    New Auto-Interp
    Negative Logits
    טו
    -0.52
    eraard
    -0.48
    upra
    -0.47
    此处
    -0.46
     Hanley
    -0.45
     corret
    -0.44
     Dempsey
    -0.44
    Memoria
    -0.42
    GRE
    -0.42
     دستی
    -0.42
    POSITIVE LOGITS
     تضيفلها
    1.02
    AddTagHelper
    0.85
    DoubleQuotes
    0.81
    VersionUID
    0.80
    
    0.75
    aarrggbb
    0.75
     мәкал
    0.75
    complexContent
    0.74
    πισ
    0.70
     виправивши
    0.69
    Act Density 2.963%

    No Known Activations