INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     NC
    -0.07
     grounds
    -0.07
     //--------------------------------
    -0.07
     mansion
    -0.07
    "]);
    -0.07
     occurrence
    -0.06
    -house
    -0.06
     peux
    -0.06
    672
    -0.06
     parts
    -0.06
    POSITIVE LOGITS
     identify
    0.12
     identified
    0.11
     identifying
    0.10
     identifies
    0.09
    identify
    0.08
    Ident
    0.08
    ди
    0.08
    0.08
    ディ
    0.08
     Identity
    0.07
    Act Density 0.046%

    No Known Activations