INDEX
Explanations
terms related to legal matters and regulations
instances of the word "isc" and its variations, likely indicating a focus on discrimination or similar terms
New Auto-Interp
Negative Logits
Despair
-0.70
Siberian
-0.69
Stard
-0.67
¥µ
-0.66
Seymour
-0.65
worldly
-0.63
BOOK
-0.63
Yug
-0.63
Khe
-0.62
chest
-0.60
POSITIVE LOGITS
onduct
1.20
ount
1.10
isc
1.08
urnal
0.96
uity
0.90
otal
0.89
retion
0.89
ayne
0.86
ussion
0.85
ot
0.85
Activations Density 0.008%