INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Hoyt
    -1.10
     tantôt
    -0.89
    roglo
    -0.88
     sauvages
    -0.85
    WEBPACK
    -0.83
     Moulton
    -0.81
     plais
    -0.81
     Bakan
    -0.80
     vincit
    -0.79
     Wic
    -0.79
    POSITIVE LOGITS
     Infrastructure
    1.07
     infrastructure
    1.05
     infrastructures
    1.03
    ?>">
    0.92
     Bridge
    0.91
    Infrastructure
    0.90
     BRIDGE
    0.87
     bridge
    0.86
    ']))
    
    0.83
     Infra
    0.82
    Act Density 0.143%

    No Known Activations