INDEX
Explanations
terms related to bending or curving actions
New Auto-Interp
Negative Logits
del
-0.51
Sal
-0.48
bodem
-0.48
Drawing
-0.47
Джерела
-0.44
カウン
-0.44
figyel
-0.43
leden
-0.43
loten
-0.43
drew
-0.43
POSITIVE LOGITS
bend
0.82
repos
0.74
Theſe
0.73
Bend
0.72
تانيه
0.71
Cæsar
0.71
bends
0.70
myſelf
0.69
bent
0.68
Huguen
0.68
Activations Density 2.319%