INDEX
Explanations
phrases related to government intervention and regulation
New Auto-Interp
Negative Logits
688
-0.17
uci
-0.16
ält
-0.15
622
-0.15
inya
-0.15
669
-0.14
avax
-0.14
uai
-0.14
852
-0.13
ebek
-0.13
POSITIVE LOGITS
intr
0.30
intrusive
0.29
intervention
0.27
interference
0.26
regulation
0.25
intrusion
0.25
Intervention
0.25
centralized
0.24
Intr
0.23
meddling
0.23
Activations Density 0.173%