INDEX
    Explanations

    specific demonstrative pronouns indicating objects or subjects in a discussion

    this/those + noun referring to specific instance

    New Auto-Interp
    Negative Logits
    الدراسه
    -0.74
    zzleHttp
    -0.73
    ロウィン
    -0.71
    fromnode
    -0.64
    pecabe
    -0.64
    majánló
    -0.64
    Obrázky
    -0.62
    iſten
    -0.62
    ɵɵelement
    -0.61
    OGND
    -0.61
    POSITIVE LOGITS
    .
    0.52
    [
    0.42
    This
    0.40
    <
    0.39
    ↵↵
    0.38
    We
    0.38
    "
    0.38
    my
    0.37
    [^
    0.36
    A
    0.36
    Act Density 0.068%

    No Known Activations