INDEX
    Explanations

    phrases that denote definitions or classifications

    New Auto-Interp
    Negative Logits
     estoppel
    -0.44
     morte
    -0.43
    Siddhartha
    -0.43
    ']))
    
    -0.42
     atheism
    -0.42
     acrobat
    -0.39
     bü
    -0.38
     shook
    -0.38
     чере
    -0.38
    timo
    -0.37
    POSITIVE LOGITS
    HasForeignKey
    0.90
     sogenannte
    0.89
     called
    0.89
     sogenannten
    0.86
    called
    0.85
    NUMX
    0.85
     Called
    0.84
    Called
    0.82
     termed
    0.81
    extAlignment
    0.76
    Act Density 0.560%

    No Known Activations