INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ים
    1.06
    1.04
    ה
    0.96
    ות
    0.95
    0.95
    ک
    0.91
    ла
    0.91
    0.90
    נו
    0.88
    0.87
    POSITIVE LOGITS
     frío
    1.52
    Cold
    1.38
     frio
    1.38
    cold
    1.32
     cold
    1.27
    1.18
     colder
    1.13
     Cold
    1.12
     soğ
    1.11
    AY
    1.05
    Act Density 0.026%

    No Known Activations