INDEX
    Explanations

    various references to properties and methods in programming contexts, particularly related to session management and data handling

    Appears before words in other languages

    multilingual instruction following

    New Auto-Interp
    Negative Logits
     Theſe
    -1.59
     myſelf
    -1.58
     Efq
    -1.55
     Monfieur
    -1.52
     itſelf
    -1.47
     Shakspeare
    -1.44
     Jefus
    -1.40
     raiſ
    -1.40
     ſeveral
    -1.40
     fubject
    -1.37
    POSITIVE LOGITS
    0.56
    <eos>
    0.54
      
    0.47
    ↵↵
    0.45
    </td>
    0.43
    <unused63>
    0.42
    ↵↵↵
    0.42
    </h3>
    0.42
    <unused61>
    0.41
       
    0.41
    Act Density 0.055%

    No Known Activations