INDEX
    Explanations

    references to interviews and discussions involving individuals

    New Auto-Interp
    Negative Logits
    insula
    -0.17
    anto
    -0.16
    agrant
    -0.15
     Doming
    -0.15
    izens
    -0.15
    ilarity
    -0.14
    apat
    -0.14
    ytt
    -0.14
    ov
    -0.14
    οÏħÏĤ
    -0.14
    POSITIVE LOGITS
    Ñħи
    0.15
    بط
    0.15
    _RD
    0.15
    hk
    0.15
     Baz
    0.14
    asin
    0.13
    alin
    0.13
     réfé
    0.13
    atori
    0.13
    raig
    0.13
    Act Density 0.114%

    No Known Activations