INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    CloseOperation
    -0.79
     journey
    -0.63
     kontinu
    -0.60
     Journey
    -0.59
     createSlice
    -0.58
    Journey
    -0.57
     فريبيس
    -0.57
     חיצוניים
    -0.57
    MetaType
    -0.55
     antemano
    -0.54
    POSITIVE LOGITS
    principalTable
    0.55
    men
    0.54
    grine
    0.54
    nen
    0.53
    rode
    0.53
    ted
    0.52
    🏽
    0.52
    towane
    0.51
    s
    0.50
    den
    0.50
    Act Density 0.140%

    No Known Activations