INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    WebVitals
    -0.75
    ||}
    -0.71
    UnsafeEnabled
    -0.70
    ļi
    -0.65
    provoking
    -0.64
    paksa
    -0.63
    Eck
    -0.62
     Lippen
    -0.62
    álat
    -0.61
     Willen
    -0.61
    POSITIVE LOGITS
     hotels
    1.64
     hotel
    1.60
     Hotels
    1.54
    Hotels
    1.52
     HOTEL
    1.48
    hotel
    1.47
     Hotel
    1.46
    HOTEL
    1.39
    hotels
    1.30
     hote
    1.30
    Act Density 0.050%

    No Known Activations