INDEX
    Explanations

    phrases indicating contrast or alternatives

    New Auto-Interp
    Negative Logits
    Portail
    -0.85
     "));
    -0.76
    '))
    
    -0.68
     GAO
    -0.67
     firebaseConfig
    -0.66
    }`
    -0.65
     
    -0.65
     ögon
    -0.64
    "));
    
    -0.63
     visor
    -0.63
    POSITIVE LOGITS
     Instead
    1.12
    Instead
    1.05
     instead
    1.01
    instead
    0.92
     Rather
    0.86
     rather
    0.82
    Rather
    0.78
    uttosto
    0.76
     Statt
    0.72
     statt
    0.68
    Act Density 0.148%

    No Known Activations