INDEX
Explanations
instances of words related to law, investigation, and accountability
New Auto-Interp
Negative Logits
ovember
-0.90
nesota
-0.78
busters
-0.74
Phones
-0.73
artifacts
-0.71
dearly
-0.70
Stars
-0.69
izons
-0.69
ifles
-0.68
landish
-0.68
POSITIVE LOGITS
ly
0.96
ness
0.90
dismantling
0.90
bred
0.87
adherence
0.83
lobbying
0.81
understatement
0.81
enough
0.79
lly
0.77
examination
0.77
Activations Density 1.316%