INDEX
    Explanations

    phrases indicating relationships or connections between entities

    New Auto-Interp
    Negative Logits
    Further
    -0.69
     Further
    -0.68
    Additional
    -0.64
     Additional
    -0.63
    further
    -0.63
     further
    -0.60
    additional
    -0.55
     FURTHER
    -0.52
     weiteren
    -0.49
     vidare
    -0.48
    POSITIVE LOGITS
     ano
    0.81
     anot
    0.75
    Ano
    0.72
     Ano
    0.69
     noDo
    0.56
     ant
    0.55
     anu
    0.54
     ANO
    0.52
    mother
    0.50
    ########.
    0.49
    Act Density 0.133%

    No Known Activations