INDEX
Explanations
infinitive forms of verbs indicating purpose or intent
New Auto-Interp
Negative Logits
ording
-0.15
azen
-0.15
rippling
-0.14
roit
-0.14
avern
-0.14
IRT
-0.14
Ñī
-0.14
reme
-0.14
lets
-0.13
jk
-0.13
POSITIVE LOGITS
constitution
0.15
wed
0.15
ssel
0.13
inka
0.13
вад
0.13
Reviewed
0.13
igel
0.13
hle
0.13
oust
0.12
juice
0.12
Activations Density 0.074%