INDEX
Explanations
instances of punctuation or textual markers indicating quotes or citations
New Auto-Interp
Negative Logits
]]=
-0.15
ì´
-0.14
CHANT
-0.14
оба
-0.14
erea
-0.14
orra
-0.13
大人
-0.13
interface
-0.13
.githubusercontent
-0.13
oro
-0.13
POSITIVE LOGITS
foot
0.24
Foot
0.21
foot
0.20
footnote
0.19
_foot
0.18
fn
0.17
Foot
0.16
FOOT
0.16
-foot
0.16
.ref
0.16
Activations Density 0.024%