INDEX
Explanations
causes, becomes, settles, precipitates
New Auto-Interp
Negative Logits
രിച്ചി
0.49
භාවිතා
0.49
讳
0.48
করেছেন
0.47
ചെയ്തി
0.46
භාවිත
0.45
bạn
0.45
explicitly
0.45
wyłącznie
0.45
вообще
0.44
POSITIVE LOGITS
creates
0.75
begins
0.73
creates
0.67
начинает
0.66
becomes
0.66
triggers
0.64
becomes
0.61
comienzan
0.61
beginnt
0.59
trigger
0.58
Activations Density 0.204%