INDEX
Explanations
phrases with person names
the character "ľ" appearing in various contexts
New Auto-Interp
Negative Logits
favor
-0.94
favour
-0.93
spir
-0.87
territorial
-0.82
reflex
-0.80
targeted
-0.78
metic
-0.78
taboo
-0.77
carrier
-0.76
wagon
-0.75
POSITIVE LOGITS
Because
1.56
They
1.54
It
1.46
And
1.46
But
1.45
That
1.43
He
1.43
Secondly
1.42
If
1.42
Otherwise
1.41
Activations Density 0.124%