INDEX
Explanations
dates, particularly in January
New Auto-Interp
Negative Logits
łgorzata
-0.69
مرئيه
-0.69
Điều
-0.68
schirm
-0.67
бло
-0.66
Morde
-0.64
hamdu
-0.63
曖昧さ回避
-0.63
Ambro
-0.62
ervlak
-0.62
POSITIVE LOGITS
January
1.86
January
1.68
JANUARY
1.57
Jan
1.56
Jan
1.55
january
1.55
Januar
1.51
JAN
1.51
january
1.50
JAN
1.50
Activations Density 0.068%