INDEX
Explanations
mentions of time-related terms and phrases, particularly those referencing the present or recent past
New Auto-Interp
Negative Logits
للاسماء
-0.85
nahilalakip
-0.70
jalá
-0.67
Personendaten
-0.66
comprends
-0.66
InjectAttribute
-0.64
venait
-0.64
خارجية
-0.63
ьаж
-0.62
Josephus
-0.62
POSITIVE LOGITS
modern
1.10
modern
0.93
moderne
0.92
Nowadays
0.92
Modern
0.86
Modern
0.85
現代
0.84
nowadays
0.83
moderna
0.81
MODERN
0.81
Activations Density 0.205%