INDEX
Explanations
phrases related to actions or concepts involving changes or modifications
New Auto-Interp
Negative Logits
Wage
-0.76
Flavoring
-0.65
Forward
-0.64
Teacher
-0.64
ĸļ
-0.63
ilial
-0.63
uyomi
-0.62
Ashe
-0.62
Showdown
-0.62
drawer
-0.60
POSITIVE LOGITS
facto
1.31
arest
1.27
cember
1.23
Blasio
1.22
activate
1.15
activated
1.13
vious
1.12
vel
1.09
utsche
1.05
jected
1.04
Activations Density 0.031%