INDEX
Explanations
instances of laws, regulations, and government-related terms and actions
New Auto-Interp
Negative Logits
eteen
-0.77
stones
-0.72
Cruel
-0.69
'';
-0.66
âĢİ
-0.65
hart
-0.65
itiz
-0.65
igators
-0.64
":-
-0.63
bats
-0.63
POSITIVE LOGITS
rightfully
0.81
rightly
0.77
unsuccessfully
0.75
indeed
0.72
preferably
0.72
tolerated
0.71
admittedly
0.69
nown
0.69
albeit
0.67
optionally
0.67
Activations Density 0.099%