INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ivated
    -0.83
    liness
    -0.77
     Beir
    -0.74
    ivating
    -0.72
    acles
    -0.70
     oun
    -0.69
    eness
    -0.68
     migr
    -0.65
    anse
    -0.65
    acle
    -0.65
    POSITIVE LOGITS
    COMPLE
    0.97
    ERROR
    0.91
    NEW
    0.88
    âĢ¢âĢ¢
    0.88
    NOT
    0.87
    Insert
    0.86
    UPDATE
    0.84
    WARNING
    0.83
    NOTE
    0.81
    TEXT
    0.81
    Act Density 0.316%

    No Known Activations