INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     knowledge
    -1.06
     Knowledge
    -0.91
    knowledge
    -0.82
    Knowledge
    -0.79
     KNOWLEDGE
    -0.79
     Perſ
    -0.71
    ſelf
    -0.69
    ſelves
    -0.66
     houſe
    -0.63
     Theſe
    -0.63
    POSITIVE LOGITS
    الحياه
    0.59
     of
    0.59
    rzost
    0.57
    щадь
    0.57
     Audiodateien
    0.56
    WithIOException
    0.56
    aarrggbb
    0.56
    MigrationBuilder
    0.56
    adpleegd
    0.55
    henswürdigkeiten
    0.54
    Act Density 0.141%

    No Known Activations