INDEX
Explanations
phrases that indicate actions or processes associated with orders and instructions
New Auto-Interp
Negative Logits
Jefus
-1.46
pleaſure
-1.44
Efq
-1.40
houſe
-1.39
Monfieur
-1.37
myſelf
-1.36
Majefty
-1.33
Theſe
-1.32
faſt
-1.32
themſelves
-1.30
POSITIVE LOGITS
afin
0.88
inorder
0.83
أجل
0.81
чтобы
0.77
是为了
0.74
to
0.73
upang
0.71
kében
0.70
Чтобы
0.70
כדי
0.70
Activations Density 0.071%