INDEX
    Explanations

    repeated references to the word "the"

    New Auto-Interp
    Negative Logits
    ensem
    -0.16
     inflamm
    -0.15
    ahoma
    -0.15
    .AppSettings
    -0.14
    rzy
    -0.14
    raÄį
    -0.14
    ıf
    -0.14
     صÙĨع
    -0.14
    soles
    -0.13
    asant
    -0.13
    POSITIVE LOGITS
     standpoint
    0.39
     perspective
    0.38
     perspectives
    0.28
     Perspective
    0.27
     outset
    0.27
     depths
    0.24
     viewpoint
    0.23
     comfort
    0.22
    pers
    0.22
     beginning
    0.22
    Act Density 0.101%

    No Known Activations