INDEX
Explanations
legal and court case-related terms and phrases
New Auto-Interp
Negative Logits
cia
-1.01
asin
-0.99
ptoms
-0.96
bet
-0.95
phia
-0.95
hin
-0.91
listens
-0.91
ortment
-0.90
orthy
-0.88
icrobial
-0.87
POSITIVE LOGITS
ACLU
0.92
Bolivia
0.89
Scalia
0.89
gins
0.88
Heller
0.88
legalizing
0.86
Ply
0.85
Wyoming
0.82
EFF
0.82
FIRE
0.80
Activations Density 0.240%