INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     صحيح
    -0.85
    nikami
    -0.79
     doctrina
    -0.75
     welding
    -0.75
     осталось
    -0.71
     drastically
    -0.71
     །
    -0.69
     drivers
    -0.69
     riders
    -0.68
     zand
    -0.68
    POSITIVE LOGITS
     Trek
    0.98
     trek
    0.90
     Himalayan
    0.88
     Vertex
    0.87
     Nepal
    0.86
    Nepal
    0.86
     Sher
    0.86
     royaume
    0.85
     Everest
    0.84
     trekking
    0.82
    Act Density 0.005%

    No Known Activations