INDEX
    Explanations

    phrases related to embracing or accepting something

    New Auto-Interp
    Negative Logits
    <bos>
    -0.68
    SizeMode
    -0.57
    otheby
    -0.57
    .*")]
    -0.55
    WriteHeader
    -0.54
    ('.'
    -0.54
    TextSpan
    -0.52
    ManyToOne
    -0.52
    ("."
    -0.50
    ]^{-
    -0.50
    POSITIVE LOGITS
     embra
    1.27
     strick
    1.20
     depic
    1.19
     snoopy
    1.14
     hentai
    1.14
     ftu
    1.13
     fta
    1.13
     thut
    1.11
     apprehen
    1.10
     reluct
    1.09
    Act Density 0.076%

    No Known Activations