INDEX
    Explanations

    the word "reality" and words that can be contrasted with it like "everywhere" or "tomorrow"

    New Auto-Interp
    Negative Logits
    "])
    
    -1.29
     ویکی‌پدیای
    -1.28
     varandra
    -1.27
     itſelf
    -1.26
     للمعارف
    -1.25
    vician
    -1.22
    ']")
    -1.21
    $.
    
    -1.20
    .)}
    -1.20
    ".
    
    -1.20
    POSITIVE LOGITS
    .
    0.78
     form
    0.67
    ,
    0.63
     dom
    0.62
     di
    0.61
     pos
    0.61
     som
    0.60
     em
    0.59
     sal
    0.58
    \
    0.57
    Act Density 1.519%

    No Known Activations