INDEX
Explanations
terms related to the legality or illegality of actions
phrases that discuss the legality or illegality of actions or behaviors
New Auto-Interp
Negative Logits
ortun
-0.78
chet
-0.70
ience
-0.70
ocaly
-0.70
pend
-0.67
Pon
-0.66
frustrations
-0.65
tone
-0.65
aida
-0.63
ignt
-0.63
POSITIVE LOGITS
punishable
0.93
legally
0.83
ifiable
0.73
anymore
0.71
SPONSORED
0.68
federally
0.68
ically
0.68
pedia
0.67
ALLY
0.67
adultery
0.66
Activations Density 0.100%