INDEX
    Explanations

    mentions of time-related terms and phrases, particularly those referencing the present or recent past

    New Auto-Interp
    Negative Logits
     للاسماء
    -0.85
     nahilalakip
    -0.70
    jalá
    -0.67
    Personendaten
    -0.66
     comprends
    -0.66
    InjectAttribute
    -0.64
     venait
    -0.64
     خارجية
    -0.63
    ьаж
    -0.62
     Josephus
    -0.62
    POSITIVE LOGITS
     modern
    1.10
    modern
    0.93
     moderne
    0.92
    Nowadays
    0.92
    Modern
    0.86
     Modern
    0.85
    現代
    0.84
     nowadays
    0.83
     moderna
    0.81
     MODERN
    0.81
    Act Density 0.205%

    No Known Activations