INDEX
    Explanations

    references to various cities in different contexts

    New Auto-Interp
    Negative Logits
    ftagPool
    -0.84
    }],
    
    -0.76
    })()
    -0.75
     '\\;'
    -0.70
    }]
    
    -0.69
    prüche
    -0.68
    ISupport
    -0.68
    icação
    -0.67
     Watanabe
    -0.66
    ]<<"
    -0.66
    POSITIVE LOGITS
     cities
    1.99
     Cities
    1.98
    Cities
    1.74
    cities
    1.66
     CITIES
    1.65
     Städte
    1.34
     Städten
    1.29
     ciudades
    1.17
     villes
    1.14
     городов
    1.09
    Act Density 0.101%

    No Known Activations