INDEX
    Explanations

    repeated proper nouns and specific names

    Two-letter abbreviations

    proper names and technical terms

    New Auto-Interp
    Negative Logits
    GEBURTSDATUM
    -0.93
    はじめに
    -0.74
    :✨
    -0.74
     بيها
    -0.62
    +][
    -0.61
    EndGlobalSection
    -0.59
     kasarigan
    -0.58
    ientôt
    -0.58
    rungsseite
    -0.58
     snippetHide
    -0.57
    POSITIVE LOGITS
     itself
    0.60
     proprement
    0.60
     itſelf
    0.59
    0.58
     stessa
    0.58
     protoimpl
    0.55
    '
    0.54
     InputDecoration
    0.53
     himſelf
    0.53
     stesso
    0.52
    Act Density 0.883%

    No Known Activations