INDEX
    Explanations

    quotes and statements from individuals in the text

    New Auto-Interp
    Negative Logits
    wan
    -0.19
    ugo
    -0.15
     Hud
    -0.15
    IGO
    -0.14
     stor
    -0.14
    tah
    -0.14
    clusion
    -0.14
    olor
    -0.14
    ETCH
    -0.14
    tas
    -0.14
    POSITIVE LOGITS
    apult
    0.16
    IRM
    0.15
    hir
    0.15
    éIJĺ
    0.14
    ÏĥÏĦε
    0.14
    ULSE
    0.14
    gil
    0.14
    dle
    0.14
    à¤ķन
    0.14
    .Startup
    0.13
    Act Density 0.028%

    No Known Activations