INDEX
    Explanations

    words related to existence or disbelief in various contexts

    New Auto-Interp
    Negative Logits
    hiba
    -0.73
    Thom
    -0.64
    edo
    -0.64
    Dro
    -0.62
    anche
    -0.61
    wer
    -0.61
     broom
    -0.61
    Ĺ
    -0.61
    ilde
    -0.60
    ney
    -0.60
    POSITIVE LOGITS
    entially
    1.05
    entials
    0.96
    places
    0.83
     within
    0.80
     existed
    0.79
    ential
    0.78
    nces
    0.78
    uate
    0.73
     solely
    0.73
     outside
    0.72
    Act Density 0.037%

    No Known Activations