INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    ny
    0.98
     mandible
    0.92
    0.90
    nik
    0.86
    ky
    0.86
    kc
    0.85
    0.84
    0.84
    DK
    0.84
     оптима
    0.83
    POSITIVE LOGITS
     meet
    0.74
     vezi
    0.74
     সাস
    0.73
    ]},
    0.72
     lift
    0.70
     Mess
    0.70
     வில
    0.69
     Subsidi
    0.69
     вър
    0.69
     prezzi
    0.68
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.