INDEX
    Explanations

    references to science and scientific concepts

    New Auto-Interp
    Negative Logits
    :
    -0.63
    yto
    -0.54
    .
    -0.51
     is
    -0.50
    ,
    -0.49
     I
    -0.49
     todo
    -0.49
     in
    -0.47
     par
    -0.46
     col
    -0.45
    POSITIVE LOGITS
     science
    1.22
    science
    1.15
     SCIENCE
    1.07
    WithIOException
    1.07
    Science
    1.07
     Science
    1.04
    cience
    1.02
    SCIENCE
    1.02
    AndEndTag
    1.00
     ]
    
    0.97
    Act Density 0.148%

    No Known Activations