INDEX
    Explanations

    references to the United States and its governmental or geographical divisions

    New Auto-Interp
    Negative Logits
    InjectAttribute
    -0.68
     ModelExpression
    -0.52
    BASELINE
    -0.50
    enumi
    -0.47
    FXMLLoader
    -0.47
    CompilerServices
    -0.47
     relâche
    -0.46
     محفوظة
    -0.45
     councillors
    -0.43
    但她
    -0.42
    POSITIVE LOGITS
     Americans
    0.86
     America
    0.84
     AMERICA
    0.76
     america
    0.73
     ddelweddau
    0.72
     mankind
    0.70
     American
    0.70
     humankind
    0.70
    industan
    0.70
    Americans
    0.68
    Act Density 0.407%

    No Known Activations