INDEX
Explanations
terms related to legal actions and their consequences
New Auto-Interp
Negative Logits
wako
-0.46
للمعارف
-0.43
Tab
-0.43
引起的
-0.43
noDo
-0.42
AnchorStyles
-0.40
Tab
-0.40
Topping
-0.40
tev
-0.40
melding
-0.40
POSITIVE LOGITS
hurt
0.81
hurts
0.74
disadvantage
0.72
negatively
0.72
penal
0.71
harm
0.70
hurting
0.66
adversely
0.65
harmed
0.64
hob
0.63
Activations Density 0.615%