INDEX
Explanations
verbs describing actions or effects
New Auto-Interp
Negative Logits
done
0.93
Done
0.88
Done
0.84
DONE
0.84
rispondere
0.79
DONE
0.77
interag
0.76
बेसिकली
0.76
done
0.76
yapılır
0.75
POSITIVE LOGITS
فراموش
0.84
employ
0.82
conlleva
0.80
affords
0.79
uphold
0.78
undertake
0.78
entail
0.78
possess
0.77
ளிட
0.77
inherited
0.77
Activations Density 0.273%