INDEX
Explanations
Latin letters C with a cedilla below
instances of questioning or requesting action
New Auto-Interp
Negative Logits
``
-1.17
`.
-1.03
``
-1.02
`,
-0.96
Âł
-0.96
`
-0.86
Âł
-0.77
````
-0.76
ÃŃs
-0.73
?".
-0.73
POSITIVE LOGITS
—
2.18
ÂŃ
1.35
âĢķ
1.25
ÂŃ
1.00
pic
0.99
»
0.91
——
0.88
â̦
0.85
(@
0.85
.—
0.85
Activations Density 0.232%