INDEX
Explanations
various references to properties and methods in programming contexts, particularly related to session management and data handling
Appears before words in other languages
multilingual instruction following
New Auto-Interp
Negative Logits
Theſe
-1.59
myſelf
-1.58
Efq
-1.55
Monfieur
-1.52
itſelf
-1.47
Shakspeare
-1.44
Jefus
-1.40
raiſ
-1.40
ſeveral
-1.40
fubject
-1.37
POSITIVE LOGITS
↵
0.56
<eos>
0.54
0.47
↵↵
0.45
</td>
0.43
<unused63>
0.42
↵↵↵
0.42
</h3>
0.42
<unused61>
0.41
0.41
Activations Density 0.055%