INDEX
Explanations
phrases indicating the cessation or end of a belief or practice
New Auto-Interp
Negative Logits
BindView
-0.65
hibli
-0.62
iseite
-0.59
Hentet
-0.56
للمعارف
-0.54
compliment
-0.54
Hola
-0.53
kozó
-0.53
igång
-0.51
Davie
-0.51
POSITIVE LOGITS
不再
0.98
longer
0.97
artık
0.90
longer
0.89
насељу
0.87
anymore
0.86
Теперь
0.84
Теперь
0.84
Longer
0.82
now
0.79
Activations Density 0.092%