INDEX
    Explanations

    references to numerical values or measurements in scientific contexts

    numeric quantities with units

    New Auto-Interp
    Negative Logits
    OGND
    -0.93
     témoig
    -0.89
    <unused43>
    -0.85
    <unused79>
    -0.85
    <unused41>
    -0.85
    <unused16>
    -0.85
    <unused8>
    -0.85
    <unused14>
    -0.85
    <unused3>
    -0.85
    [@BOS@]
    -0.85
    POSITIVE LOGITS
    0.39
    ↵↵
    0.35
    0.31
     e
    0.31
    Personensuche
    0.30
    0
    0.28
    ...
    0.27
     sarung
    0.26
    E
    0.26
     corrida
    0.25
    Act Density 0.026%

    No Known Activations