INDEX
Explanations
phrases related to the ease or difficulty of tasks and processes
difficulty and ease
New Auto-Interp
Negative Logits
Мексичка
-0.81
beſch
-0.78
المناصب
-0.78
deſſen
-0.75
surla
-0.75
geſch
-0.74
nakalista
-0.73
➟
-0.72
ſeines
-0.72
[@BOS@]
-0.71
POSITIVE LOGITS
due
0.37
because
0.36
greatly
0.34
He
0.33
easy
0.32
great
0.32
and
0.31
everywhere
0.31
easy
0.31
.
0.30
Activations Density 0.054%