INDEX
Explanations
phrases expressing tendencies or inclinations
tend to be or do
New Auto-Interp
Negative Logits
nię
-0.45
miljö
-0.42
//
-0.41
Comple
-0.41
escar
-0.40
erapa
-0.40
deko
-0.40
ailles
-0.40
Substitution
-0.40
Diſ
-0.40
POSITIVE LOGITS
tend
1.19
tends
1.16
tend
0.97
Tend
0.94
Tend
0.94
tended
0.93
tendency
0.91
tienden
0.87
tending
0.87
TEND
0.87
Activations Density 0.011%