INDEX
    Explanations

    principles and concepts related to theoretical frameworks in scientific contexts

    New Auto-Interp
    Negative Logits
    lear
    -0.14
    akk
    -0.14
    ogy
    -0.14
    optera
    -0.14
    ynes
    -0.14
     conversation
    -0.14
     Functions
    -0.14
    uple
    -0.14
    stal
    -0.14
     wz
    -0.14
    POSITIVE LOGITS
     physically
    0.25
     physical
    0.24
     quantity
    0.21
    physical
    0.21
     property
    0.21
     Physical
    0.20
     simpl
    0.20
     quantities
    0.20
    quantity
    0.19
    Physical
    0.19
    Act Density 0.140%

    No Known Activations