INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     हासिल
    0.75
    र्सन
    0.72
     motorsport
    0.69
     kvalit
    0.68
    increases
    0.66
    ER
    0.66
     strives
    0.66
     publishes
    0.66
     langfrist
    0.66
    9
    0.65
    POSITIVE LOGITS
     trembling
    0.73
     الماء
    0.63
     with
    0.59
     softly
    0.59
     cubierta
    0.58
    0.57
     debajo
    0.57
     Kepala
    0.57
     довольно
    0.55
    0.55
    Act Density 0.885%

    No Known Activations