INDEX
Explanations
words related to dates and people, often in a context involving events or actions
New Auto-Interp
Negative Logits
s
-0.95
springfox
-0.68
Kob
-0.68
Wikimédia
-0.68
യും
-0.66
色んな
-0.66
はコチラ
-0.64
xic
-0.63
Filmo
-0.62
tartalomajánló
-0.61
POSITIVE LOGITS
日
1.27
子
1.06
者
1.02
事
1.02
기
1.01
力
0.99
人
0.94
家
0.92
物
0.91
자
0.91
Activations Density 0.092%