INDEX
Explanations
references to artistic or creative work
New Auto-Interp
Negative Logits
ویکیپدیا
-0.56
ajevo
-0.54
OCITY
-0.54
scénario
-0.53
Обо
-0.51
conducive
-0.51
LookAnd
-0.51
vesse
-0.49
ʁ
-0.49
ígenes
-0.48
POSITIVE LOGITS
work
1.38
trabalho
0.90
Arbeit
0.87
trabajo
0.84
arbeit
0.84
works
0.84
arbete
0.83
oredCriteria
0.81
работы
0.80
arbejde
0.80
Activations Density 0.285%