INDEX
Explanations
words related to evaluating the consequences of actions or conditions, particularly in terms of their benefits or detriments
New Auto-Interp
Negative Logits
LabelTagHelper
-0.61
zonego
-0.48
escar
-0.47
arran
-0.46
betweenstory
-0.46
AndEndTag
-0.46
etra
-0.45
Falun
-0.45
Esperanto
-0.45
pau
-0.45
POSITIVE LOGITS
beneficial
0.73
brainly
0.72
impact
0.69
effects
0.66
geweest
0.66
impact
0.65
ائص
0.64
effect
0.63
Effects
0.62
detriment
0.61
Activations Density 0.582%