INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    TextSpan
    -0.82
    ICT
    -0.82
    agem
    -0.73
    -0.72
    Cpp
    -0.71
     AST
    -0.71
    лади
    -0.71
     Karriere
    -0.70
     zes
    -0.70
    Jo
    -0.70
    POSITIVE LOGITS
     eats
    2.77
     eating
    2.56
     eat
    2.53
     eaten
    2.17
    eating
    2.02
     Eating
    1.98
     cannibal
    1.96
    eat
    1.92
    eats
    1.88
    1.84
    Act Density 0.056%

    No Known Activations