INDEX
    Explanations

    numbers and symbols related to references or lists

    Punctuation followed by a capitalized word

    New Auto-Interp
    Negative Logits
     contextLoads
    -0.61
     समीक्षाओं
    -0.60
     Normdatei
    -0.57
     Factbook
    -0.53
     المعيارى
    -0.52
     创建时间
    -0.51
    ">//
    -0.51
     Baillargeon
    -0.51
    PhysRevD
    -0.50
    }}$\\
    -0.49
    POSITIVE LOGITS
    And
    0.75
    It
    0.70
    As
    0.67
    AND
    0.67
    In
    0.67
    To
    0.66
    They
    0.65
    This
    0.64
    A
    0.64
    For
    0.64
    Act Density 1.240%

    No Known Activations