INDEX
Explanations
legal or political terms and actions
references to legal or regulatory decisions
New Auto-Interp
Negative Logits
ILCS
-0.68
inis
-0.65
Temper
-0.63
english
-0.60
TPPStreamerBot
-0.60
!.
-0.58
Rated
-0.58
Flavoring
-0.56
Osiris
-0.56
(@
-0.55
POSITIVE LOGITS
amounted
0.96
proves
0.88
could
0.87
violates
0.84
stemmed
0.84
represents
0.83
shouldn
0.81
signifies
0.80
isn
0.79
constitutes
0.79
Activations Density 0.444%