INDEX
    Explanations

    mentions of something happening for the first time

    New Auto-Interp
    Negative Logits
     first
    -2.64
    first
    -2.03
     firſt
    -1.64
     eerste
    -1.45
     kwanza
    -1.42
     firft
    -1.42
     primero
    -1.40
     pertama
    -1.38
     primeira
    -1.37
     første
    -1.37
    POSITIVE LOGITS
    Билгалдахарш
    0.75
    BeginInit
    0.73
     المعيارى
    0.73
    Predecesor
    0.66
     सत्यापित
    0.64
    ArgsConstructor
    0.61
    BorderFactory
    0.60
     franc
    0.60
    Revenir
    0.58
    ArgumentParser
    0.58
    Act Density 1.591%

    No Known Activations